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Abstract 

We show the existence of regular combinatorial objects which previously were not known 
to exist. Specifically, for a wide range of the underlying parameters, we show the existence of 
non-trivial orthogonal arrays, i-designs, and i-wise permutations. In all cases, the sizes of the 
objects are optimal up to polynomial overhead. The proof of existence is probabilistic. We show 
that a randomly chosen structure has the required properties with positive yet tiny probability. 
Our method allows also to give rather precise estimates on the number of objects of a given size 
and this is applied to count the number of orthogonal arrays, i-designs and regular hypergraphs. 
The main technical ingredient is a special local central limit theorem for suitable lattice random 
walks with finitely many steps. 
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1 Introduction 



We introduce a new framework for establishing the existence of regular combinatorial structures, 
such as orthogonal arrays, t-designs and t-wise permutations. Let i? be a finite set and let y be a 
vector space of functions from B to the rational numbers Q. We study when there is a small subset 
T C B satisfying 

= for all/ in l^. (1) 

In probabilistic terminology, equation ([T]) means that if t is a uniformly random element in T and 
6 is a uniformly random element in B then 

EteT[f{t)]=^beB[f{b)] for all/ in y, (2) 

where E denotes expectation. Of course, ([T]) holds trivially when T = B. Our goal is to find 
conditions on B and V that yield a small subset T that satisfies ([T]), where in our situations, small 
will mean polynomial in the dimension of V. We remark that in many natural problems one might 
encounter a vector space V over M or C instead. However, since is a rational equation, we can 
always reduce to the case of rational vector spaces. 

A more concrete realization of the above framework is given by the following problem. Let (j) 
be a matrix with rational entries whose rows are indexed by a finite set B and whose columns are 
indexed by a finite set A. When is there a small subset T C B such that the average of the rows 
indexed by T equals the average of all rows? This problem is a special case of the above framework 
with V being the subspace spanned by the columns of (j). In fact, the general framework can always 
be reduced to such a problem by choosing a basis {(j)a), a £ A, of V and defining the matrix (p by 
4>{b,a) = (paib). 

Our main theorem. Theorem 12.41 gives sufficient conditions for the existence of a small subset 
T satisfying ([1]). A second theorem. Theorem 12.51 provides sharp estimates on the number of such 
subsets of a given size. We apply the theorems to establish results in three interesting cases of the 
general framework: orthogonal arrays, t-designs, and t-wise permutations. These are defined and 
discussed in more detail in the next sections. Our methods solve an open problem, whether there 
exist non-trivial t-wise permutations for every t. They strengthen Teirlinck's theorem [Tei87| . which 
was the first theorem to show the existence of i-designs for every t. And they improve existence 
results for orthogonal arrays, when the size of the alphabet is divisible by many distinct primes. 
Moreover, in all three cases considered, we show the existence of a structure whose size is optimal 
up to polynomial overhead. In addition, we provide sharp estimates for the number of orthogonal 
arrays and t-designs of a given size. As a special case, these yield estimates for the number of 
regular hyper graphs of a given degree. 

Our approach to the problem is via probabilistic arguments. In essence, we prove that a random 
subset of B satisfies equation ([T]) with positive, albeit tiny, probability. Thus our method is one 
of the few known methods for showing existence of rare objects. This class includes such other 
methods as the Lovasz local lemma |EL75| and Spencer's "six deviations suffice" method |Spe85| . 
However, our method does not rely on these previous approaches. Instead, our technical ingredient 
is a special version of the (multi-dimensional) local central limit theorem with finitely many steps. 
We cannot use any "off the shelf" local central limit theorem, not even one enhanced by a Berry- 
Esseen-type estimate of the rate of convergence, since the number of steps of our random walk 
is small compared to the dimension of the space in which it takes its values. Instead, we prove 
the local central limit theorem that we need directly using Fourier analysis. Section 11.41 gives an 
overview of our approach. 
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We also mention that efficient randomized algorithm versions of the Lovasz local lemma |Mos09t 
IMTlOj and Spencer's method [BanlOt ILM12] have recently been found. Relative to these new 
algorithms, the objects that they produce are no longer rare. Our method is the only one that we 
know that shows the existence of rare combinatorial structures, which are still rare relative to any 
known, efficient, randomized algorithm. 



1.1 Orthogonal arrays 

Here and in the rest of the paper we use the notation [m] := {1, . . . ,m}. A subset T C [q]^ is 
an orthogonal array of alphabet size q, length n and strength t if it yields all strings of length t 
with equal frequency if restricted to any t coordinates. In other words, for any distinct indices 
ii, . . . ,it £ [n] and any (not necessarily distinct) values vi, . . . ,Vt G [q], 

\{x £T : Xi^ = vi, . . . ,Xi^ = vt}\ = q~*\T\. (3) 

Equivalently, choosing x = G T uniformly, the distribution of each coordinate of x 

is uniform in [q] and every t coordinates of x are independent (x is t-wise independent). For an 
introduction to orthogonal arrays see |HSS99j . 

Orthogonal arrays fit into our general framework as follows. We take B to be [g]" and V to be 
the space spanned by all functions of the form 



f{I,v){xi,. . . ,Xn) 



1 Xi = Vi for all i G / 
Otherwise 



with / C [n] a subset of size t and v G [qY . With this choice, a subset T C B satisfying ([T]) is 
precisely an orthogonal array of alphabet size q, length n and strength t. 

t /2 

It is well known that if T C [qY^ is an orthogonal array of strength t then |r| > (£2!i)^/^ for some 
universal constant c > (see, e.g., [Rao73| ) . Matching constructions of size \T\ < q^^ {jY'''^ are 
known, however, as these rely on finite field properties, the constant Cq generally tends to infinity 
with the number of distinct prime factors of q. Our technique provides the first upper bound on 
the size of orthogonal arrays in which the constant in the exponent is independent of q. Here and 
below, a universal constant is a constant independent of all other parameters. 

Theorem 1.1 (Existence of orthogonal arrays). For all integers q>2,n>\ and 1 < t < n there 
exists an orthogonal array T of alphabet size q, length n and strength t satisfying \T\ < (^) for 
some universal constant c > 0. 

Moreover, we provide a rather precise count of the number of orthogonal arrays with given 
parameters. Such a count appears to be new even in the most well-studied case when q = 2. 

Theorem 1.2 (Number of orthogonal arrays). There exists a constant c > such that for all 
integers q > 2, n > 1 and 1 < t < n and for all N satisfying that N is a multiple of g* and 
min(A^, g" — N) > (^) , we have that the number of orthogonal arrays T of alphabet size q, 
length n, strength t and |T| = equals 



(27rp(l - p))5 ELo {lY<i-^YpN(i _ p)q"-N 

/ cqn \ '^^ 

where p := and \6\ < , 



i^ + S) 
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As a final remark we note that any orthogonal array T of alphabet size q and strength t must 
satisfy that |T| is a multiple of by ([3]). Hence the essential restriction on N in the last theorem 
is only that N be bounded away from and q"". 

1.2 Designs 

A (simple) t-{v, k, A) design, or t-design for short, is a family of distinct subsets of [v], where each 
set is of size k, such that each t elements belong to exactly A sets. In other words, denoting by 
[^] the family of all subsets of [v] of size k, a set T C [^] is a t-design if for any distinct elements 

\{sGT:ii,...,ites}\=X. (4) 
It follows that A satisfies the relation 

For an introduction to combinatorial designs see |CD07j . 

Our general framework includes f-designs as follows. We take B to be [^] and V to be the space 
spanned by all functions of the form 



fa{b) 



1 aCb 

Otherwise 



with o G [^] . With this choice, a subset T C B satisfying ([TJ is precisely a simple t-{v, k, A) design, 
with A given by ([5]). 

Although t-designs have been investigated for many years, the basic question of existence of 
a design for a given set of parameters t,v,k and A remains mostly unanswered unless t is quite 
small. The case t = 2 is known as a block design and much more is known about it than for larger 
t. Explicit constructions of t-designs for t > 3 are known for various specific constant settings of 
the parameters (e.g. 5-(12, 6, 1) design). The breakthrough result of Teirlinck |Tei87j was the first 
to establish the existence of non-trivial t-designs for t > 7. In Teirlinck's construction, k = t + 1 
and V satisfies congruences that grow very quickly as a function of t. Other sporadic and infinite 
examples have been found since then (see |CD07| or |Mag09| and the references within), however, 
the set of parameters which they cover is still very sparse. 

It follows from ([5]) that any t-(v,k,X) design T has size |r| = A(^)/(^) > (|) . Moreover, it 

can be shown |RCW75] that whenever v > k + t the inequality |r| > (|^(/2j) — (l)'^*^^^ holds. Even 
when existence has been shown, the designs obtained are often inefficient in the sense that their size 
is much larger than these lower bounds permit. One of the main results of our work is to establish 
the existence of efficient t-designs for a wide range of parameters. 

Theorem 1.3 (Existence of t-designs) . For all integers v > 1, 1 <t < v and t < k < v there exists 
a t-{v,k,X) design whose size is at most {^Y^ for some universal constant c > 0. 

Our work also provides a rather precise count of the number of t-designs of a given size with 
given parameters. To state this count precisely, we recall the well-known observation that if T is a 
t-{v, k, A) design then for every 1 < s < t, each subset of size s in [v] is covered by exactly 

\ \S/ \rp\ 



5 



sets in T. In particular, (A^), 1 < s < t must be integers. Our next theorem, in addition to 
estimating the number of designs, imphes that if |r| (or equivalently A) is sufficiently large, these 
integrality conditions suffice for the existence of t-{v,k, A) designs. 

Theorem 1.4 (Number of t-designs). There exists a constant c > such that for all integers v > 1, 
1 < t < V and t < k < V and for all N satisfying that the numbers 

(') 

-Tjjr^ are integers for I < s <t 

\s) 

and satisfying that min(A^, (^) — N) > {^)^^ , we have that the number of t-{v,k, \) designs T of 
size |T| = equals 



n 



(to 
/v—t—s\ 
{ k-t ) 



(O-UO)^ 
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where p := 7^, \6\ < , and ( is defined to be 0. 

(fe) ' ' - ^min(iV,(-)-7V) ^"^^ 

1.2.1 Regular hypergraphs 

A (simple) k-uniform hypergraph on n vertices is a family of sets of size k (called edges) on n 
elements (called vertices). A hypergraph is d-regular if each vertex belongs to exactly d edges. It is 
straightforward to check that d-regular, /c-uniform hypergraphs are the same as l-{n,k,d) designs. 

The existence question for d-regular, fc-uniform hypergraphs is quite simple, such a hypergraph 
exists if and only if nd is divisible by k, and in this case any such hypergraph has exactly ^ edges. 
However, counting the number of d-regular, /c-uniform hypergraphs is a non-trivial problem which 
has received much attention in the literature, mainly in the graph case (k=2). See, e.g., the paper 
by Wormald and McKay jMW90j and references within. In the graph case, approximate counts are 
known when the graphs are either somewhat sparse, or very dense. 

Since d-regular, fe-uniform hypergraphs are a special case of t-designs, we may translate The- 
orem 11.41 to obtain a count of such hypergraphs. Our result applies for k and d sufficiently large 
and appears to be new. 

Theorem 1.5. There exists a constant c > such that for all integers n>2, \ <k<n and 
1 < d < (^Z]^) satisfying that nd is divisible by k and min(^, (^) — ^) > n"^, we have that the 
number of d-regular, k-uniform hypergraphs on n vertices equals 




where p := and \6\ < , 

We mention also a related result on the asymptotic formula for the number of binary contingency 
tables obtained in [CM05j . A binary contingency table is an x n matrix with entries in {0, 1}, all 
row sums equal to k and all column sums equal to d (so that necessarily N = Such a matrix 
describes a d-regular, A;-uniform hypergraph on n vertices with labelled edges and allowing multiple 
edges. Equivalently, it describes a bipartite graph with N vertices of degree k on one side and n 
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vertices of degree d on the other side. When the parameters (A^, n, k, d) are such that most binary 
contingency tables have all rows distinct, the number of such tables is close to A^! times the number 
of d-regular, fc-uniform hypergraphs on n vertices. Thus our result can probably be used to extend 
the asymptotic formula of [CMOS] to additional cases. We do not develop this direction here. 

1.3 Permutations 

A family of permutations T C 5„, is called a t-wise permutation if its action on any i-tuple of 
elements is uniform. In other words, for any distinct elements ii,. . . ,it £ [n] and distinct elements 

|{7r G T : 7r(ii) = ji, . . . ,7r{it) = jt}\ = — — 7— TtI^I- (6) 

n(n — Ij • • • (n — I + Ij 

Our general framework includes t-wise permutations as follows. We take B = Sn and V to be 
the space spanned by all functions of the form 



1 b{h) = ji,---,b{it) = jt 
Otherwise 



where i = {ii, . . . ,it) and j = (ji, . . . ,jt) are t-tuples of distinct elements in [n]. With this choice, 
a subset T C B satisfying ([T]) is precisely a t-wise permutation. 

Equation ([6]) yields a lower bound on the size of t-wise permutations, \T\ > n{n—l)--- (n — 1 + 1). 
Constructions of families of t-wise permutations matching this lower bound are known for t = 1,2, 3: 
the group of cyclic shifts x 1— )• x + a modulo n is a 1-wise permutation; the group of invertible affine 
transformations x 1— )• ax + b over a finite field F yields a 2-wise permutation; and the group of 
Mobius transformations x 1— t- (ax + b)/{cx + d) with ad — he = 1 over the projective line F U {cxd} 
yields a 3-wise permutation. However, it is known (see, e.g., |Cam95| . Theorem 5.2) that for n > 25 
and t > 4 there are no subgroups of Sn which form a t-wise permutation (of any size) , other than 
Sn itself and the alternating group An- Moreover, for t > 4 (and n large enough), no non-trivial 
constructions of t-wise permutations are known at all |KNR05t lALll) . with the exception of the 
recent work jFPY12j which constructs rather large, but non-trivial, t-wise permutations of size 
^2n £qj, infinitely many values of n and t. One of our main results is the existence of small t-wise 
permutations for all n and t. 

Theorem 1.6 (Existence of t-wise permutations). For all integers n > 1 and 1 < t < n there exists 
a t-wise permutation T C Sn satisfying \T\ < {cnY^ for some universal constant c > 0. 

We leave the problem of estimating the number of t-wise permutations of a given size for future 
work. See Section 13.51 for numerical calculations related to this problem. 

1.4 Proof overview 

The idea of our approach is as follows. Let us first consider the following slight simplification of our 
main idea. Let T be a random multiset of B of some fixed size chosen by sampling B uniformly 
and independently N times (with replacement). Let {4>a)aeA be a basis of integer- valued functions 
for V (where A is some arbitrary finite index set). Observe that T satisfies ([T]) if and only if 



teT ' ' be-B 



.teT 



for all a in A, (7) 
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where we add terms multiple times if they appear in T multiple times. Thus, defining an integer- 
valued random variable 



and X := {Xa)aeA £ we see that existence of a multiset of size satisfying ([T]) will follow if 
we can show that ¥[X = K[X]] > 0. To this end we examine more closely the distribution of X. 
Let ti, . . . ,tN be the random elements chosen in forming T. The basis {(j)a)a<^A defines a mapping 
(j) : B hy the trivial 



may be viewed as the end position of an A^-step random walk in the lattice Z . Thus we may 
hope that if is sufficiently large, then X has an approximately (multi-dimensional) Gaussian 
distribution by the central limit theorem. If the relevant local central limit theorem holds as well, 
then the probability ¥[X = x] also satisfies a Gaussian approximation. In particular, since a 
(non-degenerate) Gaussian always has positive density at its expectation, we could conclude that 
¥[X = ^[X]] > as desired. Moreover, to estimate the number of multisets of size satisfying 
([1]) we need only estimate F[X = E[X]] using the Gaussian approximation. 

The above description is the essence of our approach. The main obstacle is, of course, pointed 
out in the last step. We must control the rate of convergence of the local central limit theorem well 
enough so that the convergence error does not outweigh the probability density of the Gaussian 
distribution at E[X]. Recall that the order of magnitude of such a density is typically c"'"^' for 
some constant c > 1, and recall that \A\ is the dimension of V, which is the main parameter of 
our problem. So we indeed have very small probabilities. For this reason, and because we want 
convergence when A^ is only polynomial in the dimension of V, we were unable to use any standard 
local central limit theorem. Instead, we develop an ad hoc version using Fourier analysis. 

In our proof of the main theorem, we modify the above description in one respect. It is tech- 
nically more convenient to work with a slightly different probability model. Instead of choosing T 
as above, we set p := N/\B\ and define T by taking each element of B into T independently with 
probability p. This has the benefit of guaranteeing that T is a proper set instead of a multiset. 
However, it has also the disadvantage that it does not guarantee that \T\ = N. To remedy this, we 
assume that the space V contains the constant function /i = 1; or if not, we can add it to V at the 
minor cost of increasing the dimension of ^ by 1. With this assumption, since 



we see that ([7]), or equivalently X = IE[X], implies both that \T\ = N and that ([T]) holds. Another 
disadvantage is that in this new probability model, the vector X is no longer a sum of identically 
distributed variables. However, since the summands in ([8]) are still independent, we can continue 
to use Fourier analysis methods in our proof. 

We cannot expect there to always be a small subset T C B that satisfies ([1]). For instance, Alon 
and Vu [AV97| found a regular hypergraph with n vertices and ~ n"/^ edges, with no non-trivial 
regular sub-hypergraph. Here, a regular hypergraph is one in which every vertex belongs to the 
same number of hyperedges. We may describe their example in our language by letting B be the 




E Y,h{t) =E[\T\] = N., 
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set of hyperedges of this hypergraph, A be its vertex set, and define (f) : B ^ {0, 1}^ by letting (f>{b) 
be the indicator function of the set of vertices contained in b. The result of [AV97| implies that 
while the vector YlbeB *^(^) constant, this property is not shared by X^teT '?^(*) ™y non-empty, 
proper subset T C B. Thus, we need to impose certain conditions on B and V, or equivalently on 
the map (p. 

We will require certain divisibility, boundedness and symmetry assumptions. Our main theorem 
shows that these conditions suffice to yield the existence of a small solution of ([T]) and, moreover, 
a rather precise estimate for the number of solutions. The conditions and the statement of the 
theorem appear in Section [2j The existence and counting theorems for orthogonal arrays, i-designs 
and i-wise permutations follow by verifying these conditions for the choice of B and V detailed in 
Sections 11.11 through 11.31 

1.5 Related work 

In the probabilistic formulation ([2|) of our problem we seek a small subset T C B such that the 
uniform distribution over T simulates the uniform distribution over B with regards to certain tests. 
There are two ways to relax the problem to make its solution easier, and raise new questions 
regarding explicit solutions. 

One relaxation is to allow a set T with a non-uniform distribution ^ which simulates the 
uniform distribution over B. For many practical applications of t-designs and t-wise permutations 
in statistics and computer science, but not quite every application, this relaxation is as good as the 
uniform question. The existence of a solution with small support is guaranteed by Caratheodory's 
theorem, using the fact that the constraints on /x are all linear equalities and inequalities. Moreover, 
such a solution can be found efficiently, as was shown by Karp and Papadimitriou |KP82| and in 
more general settings by Koller and Megiddo |KM94j . Alon and Lovett |AL11| give a strongly 
explicit analog of this in the case of t-wise permutations and more generally in the case of group 
actions. 

A different relaxation is to require the uniform distribution over T to only approximately sat- 
isfy equation ([2j). Then it is trivial that a sufficiently large random subset T G B satisfies the 
requirement with high probability, and the question is to find an explicit solution. For instance, we 
can relax the problem of t-wise permutations to almost t-wise permutations. For this variant an 
optimal solution (up to polynomial factors) was achieved by Kaplan, Naor and Reingold [KNROSj . 
who gave a construction of such an almost t-wise permutation of size n^'^^\ 

A framework for studying problems similar to ours in a continuous setup was introduced by 
Seymour and Zaslavsky [SZ84] . They consider the case when i? is a path-connected topological 
space with a measure /i of full support, and the problem of finding a finite subset T C B such 
that the uniform distribution over T integrates certain continuous functions exactly as /i. This 
framework is sometimes referred to as "averaging sets" , "equal- weight quadratures" or "Chebyshev- 
type quadratures". It was shown in |SZ84] that such T exist in great generality and that their 
cardinality can be any number with finitely many exceptions. However, no quantitative upper 
bound on \T\ is known in this generality. 

1.6 Paper organization 

We give a precise description of the general framework and our main theorem in Section [2j We 
apply it to show the existence and estimate the number of orthogonal arrays, t-designs and t-wise 
permutations in Section [3l The proof of our main theorem is given in Section [H We summarize 
and give some open problems in Section [5l 
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2 General framework 

Let B he a finite set and let y be a linear subspace of functions / : — )• Q. The goal of this work 
is to find sufficient conditions for the existence of a small set T C B such that 

Tf-\T.f(*^ = JW\T.f(^^ for all/ in l^, (9) 

' ' teT ' ' beB 

and moreover to estimate the number of such sets of a given size. We now describe a list of 
conditions on V which will be sufficient for this task. Some of these conditions are easy to verify 
in applications, while others require some effort. We stress that in all of our applications these 
properties are verified explicitly; this is in contrast with the fact that we do not know how to find 
T explicitly. 

Divisibility. For ([9]) to hold for a set T with |r| = N we must have 

E^(^) = M^-^^^^ for ah/ in y. 

teT ' ' beB 

In particular, we must have that 

there exists a 7 E Z^^ such that ^ 7^/(6) = X] •^'^^^ ^ ^^^^ 

beB ' ' beB 

The set of all integers satisfying pup consists of all integer multiples of some minimal positive 
integer ci. To see this, observe that if A^i and are solutions then their difference is also a 
solution. It follows that |r| must be an integer multiple of ci. This is the divisibility condition 
that we require, and we call this ci the divisibility constant of V. 

We remark that to check the divisibility condition in practice it suffices to check (llOp for / 
belonging to some basis of V. More explicitly, we make the following definition. 

Definition 2.1. Let (j) : B ^ Q"^ for some finite set A. We define C{(j)) to be the lattice spanned 
by 0(6), beB. I.e., 

IbeB ) 

Using this definition, ii (j) : B ^ Q"^ is such that the vectors {<j)a)) a, £ A, form a basis for V , 
then the divisibility condition is equivalent to having |^ YlbeB ^(b) S -^^(0)- 

Boundedness. We make the following definition. 

Definition 2.2. Let W C be a vector space. For 1 < p < 00, we say that W has a c-bounded 
integer basis in ip if W is spanned by integer vectors whose ip norm is at most c. That is, if 

Span({7 €WnZ^ : \\-f\\p < c}) = W. 

We will only use in this paper the norms ||7||i = X^^eB and ||7||oo = maxbgs {jhl- We denote 
by V-^ the orthogonal complement of V in , that is, 

:= {<7 G : 5] f{b)g{b) = V/ G V}. 
beB 
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We impose the conditions that for some small C2 and C3, V has a C2 -bounded integer basis in 
and V-^ has a cs-bounded integer basis in £1. 

In our applications, the boundedness condition for V follows directly from the definition. How- 
ever, the boundedness condition for V'^ is less trivial and requires far more work to check. We 
view this condition as an analog of the LDPC (Low Density Parity Check) condition in coding 
theory, when viewed over the integers, and we develop techniques based on coding theory in order 
to guarantee it. In particular, we show in Section 13.11 that the condition is implied by a certain 
local decodability property of V. 

Symmetry. The next condition relates to the symmetries of the subspace V. 

Definition 2.3. A symmetry of ^ is a permutation vr G Sb satisfying f o n £ V for all f gV . 

Equivalently, if (p : B ^ Q"^ is such that the vectors {4>a), a G A, form a basis for V then 
a permutation vr € Sb is a symmetry of V if and only if there exists an invertible linear map 
T : such that 

(j){7r{b)) = T{(l){h)) for ah 6 G 

It is straightforward to check that the set of symmetries of V forms a subgroup of Sb- We impose 
the condition that this group acts transitively on B. That is, that for any 61,62 G B there exists 
a symmetry vr of y satisfying vr(6i) = 62. In our applications this condition follows from the 
symmetric nature of their description. 

Constant functions. Our last condition is required for somewhat technical reasons as explained 
in the proof overview section. We require the constant functions to belong to V . We note that this 
is the case in all of our applications. 
We can now state our main theorem. 

Theorem 2.4 (Main Theorem). There exists a constant C > such that the following is true. Let 
B he a finite set and let V he a linear suhspace of functions / : i? — t- Q. Assume that the following 
conditions hold for some integers ci, C2, C3 > 1, 

1. Divisihility: ci is the divisihility constant ofV. 

2. Boundedness ofV: V has a C2-hounded integer hasis in ic>o- 

3. Boundedness ofV^: V-^ has a c^-hounded integer hasis in £1. 

4- Symmetry: for any 61,62 G B there exists a symmetry nofV satisfying vr(6i) = 62. 
5. Constant functions: The constant functions helong to V . 

If 

N is an integer multiple of ci satisfying min(A^, \B\ — N) > C ■ C2c| dim(y)^ log(2c3 dim(y))^ 

(11) 

then there exists a suhset T C B of size \T\ = N satisfying 
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A second goal of our work is to count the number of subsets T of a given size which satisfy 
(I12p . To this end, we define a parameter p{V) of the vector space V as follows. It is easiest to 
define p{V) via a choice of basis for V but we stress that its value is independent of this choice. If 
(j) : B Q"^ is such that the vectors {(j)a), a G A, form a basis for V, we define 

where in the numerator we mean the determinant of the lattice C{4>) (i.e., the volume in the 
appropriate dimension of a fundamental parallelogram of the lattice generated by {(j){b)}, b £ B) 
and in the denominator, the root of the determinant of the A x A matrix whose a, a' entry is 
(/)(6)a(/>(fo)a' (we denote by the transpose of The definition takes a more symmetric 
form upon noting that the denominator is the determinant of the lattice in generated by {(?i>a}, 
ae A. 

Theorem 2.5. There exists a constant C > such that the following is true. Let B be a finite set 
and let V be a linear sub space of functions f : B ^ <Q. Assume that the conditions of Theorem \2.4\ 
are satisfied with constants ci,C2,C3 > 1 and that N satisfies pT|) . Then the number of subsets 
T C B of size N which satisfy (112p equals 

(27rp(l -p))^^p^(l 

where v- ^ and \S\ < gdim(v-)3(iog(2c2 dim(y)))3/^ 
wtiere p .- ana \o\ s ^^in{N,\B\-N) 

As explained before, the divisibility requirement in (jlip is a necessary condition for the existence 
of a subset T C B of size N satisfying (jl2p . Our theorem says that when is not too close to or 
\B\ this condition is also sufficient, and gives a rather precise count of the number of such subsets. 

Finally, we remark that our techniques yield a bit more. One can use them to show the existence, 
and estimate the number, of subsets T of a given size on which the average of functions f G V 
has a specified (small) displacement from the average over all of B. This extension is described in 
Sections O and 



3 Applications 

In this section we apply our main theorem, Theorem l2.4( to prove the existence results for orthogonal 
arrays and t-designs, Theorems 11.11 II. 2|, 11.31 and 11.61 It will be useful to introduce the following 
notation. For a map (j) : B ^ Q"^ and a vector 7 G we let 

0(7) = 5]7fe0(^)eQ^- 

beB 

We also define 

UWoo ■■= ^ max \(p{b)a\. 

We start by describing a certain condition which implies the boundedness condition for V-^ and 
which will be useful in our applications to orthogonal arrays and t-designs. 
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3.1 Local decodability 

In all of our applications it turns out that the most difficult condition to verify is that V-^ has a 
bounded integer basis in ii. This condition can be seen as an analog of the Low Density Parity 
Check (LDPC) notion coming from coding theory. We next introduce another condition which 
implies that V-^ has a bounded integer basis in ii, but which is sometimes easier to verify in 
practice. This condition is motivated by the notion of locally decodable codes in coding theory. 
Local decodability of codes is mainly studied in the context of codes defined over finite fields, see, 
e.g., [Yekllj . Here, we study an analog of these definitions for codes defined over the rationals. 

Formally, we require that for some bounded integer basis {(pa), a G A, ol V, we may express a 
small multiple of the unit vectors (in the basis given by A) by short integer combinations of (pib). 
The condition is also related to the notion of bi-orthogonal system in functional analysis. 

Definition 3.1 (Local decodability). A map 4> : B ^ is locally decodable with bound c if there 
exists an integer m > 1 with \m\ < c and a set of vectors (7") C Z^, a G A, satisfying ||7'^||i < c 
and 

</.(7«) = m-e" (aeA), 
where G {0, 1}"^ is the unit vector with 1 in coordinate a. 

Claim 3.2. If V has a basis of integer vectors {(pa) C "L^ , a ^ A, such that \\(p\\oo < C2 o-nd (p is 
locally decodable with bound C4 then V-^ has a c^-bounded integer basis in £1 with C3 < 2c2C4\A\. 

Proof. Let m and (7^*), a G ^, be as in Definition 13.11 for (p. Define the vectors (6^) C Z^, b € B hy 

■.= m-u' -Y,(P{b)a-l\ 

where G {0, 1}^ is the unit vector with 1 in coordinate b. We claim that the set {5^ : b G B} 
linearly spans V-^. First, note that 6'' G V-^ for all b e B since 

(Pis') = m ■ 0(n^) - Hb)a ■ 0(7") = m ■ cp{b) - ^ cp{b)a ■ (m • e^) = 0. 

We next argue that the rank of {iJf, : G -B} is at least \B\ — \A\, and hence they must span V-^. 
To see this, let ^ : — )• QI-^I~I^I be an arbitrary surjective linear map which sends {7" : a G A} 
to zero. Then {61, : b G B} are mapped to a basis of QI^I^I^^I by ^ and hence their dimension is at 
least \B\ — \A\ (in other words, the {6^} are a linear perturbation of the B x B identity matrix by 
a matrix whose rank is at most |^|). The bound on C3 follows since 

\Mi<m + Y,\(Pib)a\hli<C4 + \A\c2C4<2c2C4\A\ (beB). □ 

3.2 Orthogonal arrays 

We prove Theorems 11.11 and 11.21 in this subsection. We recall the relevant definitions from the 
introduction. A subset T C [(7]" is an orthogonal array of alphabet size q, length n and strength 
t if it yields all strings of length t with equal frequency when restricted to any t coordinates. 
In other words, for any distinct indices ii,...,it G [n] and any (not necessarily distinct) values 
vi,...,vte [q], 

\{x £T : Xi^ = vi, . . . ,Xi^ = vt}\ = q~^\T\. 
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Orthogonal arrays fit into our general framework as follows. We take B := [g]" and V to be the 
space spanned by all functions of the form 



f{I,v){xi, - ■ ■ ,Xn) 



1 Xi = Vi for alH G / 
Otherwise 



with / C [n] a subset of size t and v G [qY . With this choice, a subset T C B satisfying ([T]) is 
precisely an orthogonal array of alphabet size q, length n and strength t. 

We shall now verify the conditions of Theorem 12. 41 for V. We note that the sum of all the above 
f[l^v) is a constant function, thus verifying the constant functions condition. We continue with the 
symmetry condition. Fix x S [g]" and consider the permutation tTx & Sb given by 7rx{b) = b + x 
(mod q), where we apply the modulo q coordinate-wise, and with the convention that it maps Z 
to [q]. We will show that each tTx is a symmetry of V, which will establish the condition since the 
group {tTx : x G [q]""} acts transitively on B. It suffices to show that for {I,v) of the above type, 
we have /(/,^) o ttx £ V. Indeed, 

(/(/,!,) o -^x){b) = f{i,v){b + X (mod q)) = f(iy){b) G V, 

where v'^ = Vi — Xi (mod q) for i £ I. 

To verify the remaining conditions in Theorem 12.41 we choose a convenient basis for V. The 
above set of functions {//,,;} is linearly dependent in general. Let 

A:={{I,v) : \I\<t,ve[q-lY}, 

and set (pa = f(i,v) for ^ = (-^j v) £ A. Here, by /(0,0) we mean the constant one function. We will 
show that the (</>a), a £ A, form a basis for V, that the lattice C{(f)) which they generate equals 
and that they are locally decodable. The remaining conditions of Theorem 12.41 will follow easily 
from these properties. 

Claim 3.3. The span of the functions {(l)a}aeA is V . 

Proof. Clearly (pa ^ V for all a £ A. To see that the {0a}aeA also span V , it suffices to show 
that any f[i^v) with |/| < t and v G [qY is in the span of {(pa}a&A- We do this by induction on 
the number of elements in v which are equal to q. Let (/, f) have |/| < t and v G [qY ■ First, if 
V £ [q — lY then (/, w) G A by definition. Now suppose v G [qY \ — 1]^ and let io G I be such 
that = q. For j G [g — 1] define by = Vi, i £ I \ {io}, and vj^ = j. Define v' to be the 
restriction of f to / \ {io}. Then 

9-1 

f{i,v) = f{i\{iQ},v') - ^f{i,vo) 
i=i 

and, by induction, the right hand side belongs to the linear span of {(pa}a&A- D 

Claim 3.4. The map (p is locally decodable with bound 2^ and tti — 1. Consequently ^ the {^(pa)^ 
a £ A, form a basis for V and C{(p) = "L^. 

Proof. For J C / C [n] and v G [qY , we write v\j for the restriction of v to J. Define a partial 
order on A by letting a' < a, for a = {I,v) and a' = {I',v'), if I' C / and v' = v\j'. For each 
a = (I, v) G A, define an element 6" G by 
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The definition of (f) implies that 

(t>a'{b'') = \a'<a) (.a,a'eA). 
Define for each a = {I,v) ^ A the vector 7" G by 

((-1)\IHJ\ b = b^-^''"^^ for some J C I 



lb 



otherwise 



We have ||7"||i = 2^^^ < 2*. We will show that (pi^"") = e" for each a G A, thereby establishing the 
local decodability claim. By the above, if a = {I,v) and a' = {I',v') then 

JCI JCI 

nil - 

I'CJCI j=o \ J / 

as we wanted to show. Local decodability implies that the (</>a), a £ A, have full rank, and together 
with Claim [3131 we deduce that they form a basis for V. Additionally, the fact that ^(7") = e" for 
every a G A implies that = Z"^. □ 

Claim 3.5. The divisibility constant ofV equals g*. 

Proof. It suffices to show that is the smallest positive integer N for which 

^ E '^(^) e m 

' ' beB 

Indeed, since C{4>) = by Claim [3^ this follows by noting that 

E '^(^)a.-) = 4rli^ e [^]" : X, = t;, Vi G /}| = ^-l^l. □ 

' ' beB ^ 

We are now in place to apply Theorem 12.41 The divisibility constant of ^ is ci = . The {(l)a}^ 
o G ^, are a 1-bounded integer basis for V giving C2 = 1. We have 

dim(V) H^l =E (';)(«-!)• <(';).' <(!f)' (14) 

with the next to last inequality following since V was defined as the span of /(/,^) with |I| = f 
and V G [qY . Since local decodability holds with bound C4 = 2*, we deduce from Claim [3^2] that 
y-*" has a C3-bounded integer basis in £1 with C3 < 2c2Ci\A\ < 2(^^^)*. Hence we establish the 
existence of an orthogonal array of alphabet size q, length n, strength t and size \T\ 

< (£2!!)^* for 

some universal constant c > 0. 

Lastly, we aim to use Theorem l2.5l to count the number of orthogonal arrays of a given size. To 
this end we need only calculate p{V). Observing that for our choice of cj) we have det(£(0)) = 1 by 
Claim 13.41 we may apply formula (fT3l) (with our choice of (f)) to obtain 

^^^^ _ det(£(<A)) _ 1 
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Claim 3.6. det(0V) = ')(''"^)* 
It follows from the claim that 



P{V) 



-n(";^)(,-l)V2 



Together with the calculation of dim(y) in (|14|) . Theorem 11.21 now follows from Theorem YF. 

Proof of Claim It will be useful to let n and t vary in the proof of the claim. Hence we shall 
write, for n > 1 and < t < n, A{n, t) for A and (j){n, t) for Denote t) := 4>{n, tY(p{n, t) 
(where 0(n, t)* denotes the transpose of (j){n, t)). Then R{n, t) is an A{n, t)xA{n, t) matrix satisfying 



^("-i *)(/,!,), (7',t,') = ■■■ ,Xn) G [g]" : Xi = ViWiG I and x,/ = Vi>\/i £ l'}\. 



We also let 



i=o 



Define an,t '■= logo(det(i?(n, t))) so that the claim states 



On,* 



n 



n 



1 



(15) 



(16) 



(17) 



We first establish this fact when t = or n = t. Indeed, if t = we have A{n,t) = {(0,0)} 
and ^(?T-, t)(0,0),(0,0) = q^, proving that Onfl = n. If n = t, |^(n, i)| = and hence (j){n,t) is a 
square matrix with the property that the lattice spanned by its rows, by Claim Eai is Thus 
det{(j){n,t)) = 1 and hence det(i?(n,i)) = 1 = q^, verifying (fT7|) in this case as well. 
In the rest of the proof we will show that for any n > t > we have 



0'n,t 



in-l,t + dn-l,t + {q — l)an-l,t-l — dn-l,t-l- 



(18) 



Noting that dn-i,t — dn-i,t-i = ("7 )(? ~ ^y (fT6|) . the claim follows upon verifying that the 
(an^t) defined by (fTTl) satisfy this recursion. 

To prove (jlSp . fix n > t > and partition A{n,t) as follows: 

A{n,tf := {{I,v) G A{n,t) : n ^ I}, 

A{n,ty := {{I,v) £ A{n,t) : n G / and v„ = j}, j £ [q - 1]. 

Observe that A{n — 1, t — 1) C A{n — l,t) = A{n, t)°. Denote by R{n, tf'^ the sub-matrix of R{n, t) 
whose rows are indexed by A{n,ty and whose columns are indexed by A{n,ty . By (fTSll we have 

'qR{n-l,t) i = j = 
R{n,ty'^=lo ij e[q-i]^i^j , (19) 

^R{n-l,t-l) i,j £[q-l],i=j 

where in the second case we mean the matrix, and in the third case we identified a = {I, v) G 
A{n,ty with a' = {I' ,v') G A{n - l,t - 1) defined by letting I' = I \ {n} and = Vi for i G /'. 
Summarizing these equalities we have 

/ qR{n-l,t) R{n,t)°'^ 
i?(n,t)i'0 R' 
R{n,t)= R{n,ty'' 



i?(n,i)0'2 


R' 



\ R{n,ty-^'° 







R{n,tf'i-^ \ 





R' 



J 
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where the i,j cell in the displayed matrix corresponds to the sub-matrix indexed by A{n,ty and 
A(n,ty , and where we abbreviated R' := R{n — l,t — 1) for display purposes. In addition, ()15p 
implies that if ai € A[n, ty^ and 02 G A{n, ty^ then 



R(nt) J1 = 0,J2^0 



(20) 



Thus, if we subtract from each of the columns indexed by 02 G A{n, t) \ A{n, t)^ the corresponding 
column indexed by G ^(?^, i)" times | we obtain the matrix i?(n, t) satisfying 



R{n,t) 



( qR{n-l,t) 
i?(n,t)2.o 



(1 - 



y R{n,t 



,9-1,0 



Denoting by / the identity matrix and by 1 the square matrix containing all ones, both of dimension 
q — 1, it follows that 



det(i?(n, t)) = det(i?(n, t)) = det{qR{n - 1, t)) det I - -1 ® R{n -l,t-l 



ja.n-i,t+dn-i,t + {q-i)an-i,t-i (^g^ i I — — 1 



dn- 



^1n-l,t+(in-l,t + (g— l)an-l,t-l— (in-l,t- 



where we used that if yl is an m x m matrix and B is a k x k matrix then det (A B) = 
det(A)'^ det(i3)™', and where we calculated det(/ — ^l) = ^ since the only non-zero eigenvalue 
of 1 is (7 — 1. This establishes the recursion (jlSp and finishes the proof of the claim. □ 



3.3 Designs 

We prove Theorems 11.31 and 11.41 in this subsection. We recall the relevant definitions from the 
introduction. A (simple) t-{v, k, A) design, or t-design for short, is a family of distinct subsets of 
[v], where each set is of size k, such that each t elements belong to exactly A sets. In other words, 
denoting by [^] the family of all subsets of [v] of size A:, a set T C [^] is a t-design if for any distinct 
elements ii, . . . ,it G [v], 

\{s€T:h,...,it€s}\=^\T\=X. (21) 

Our general framework includes t-designs as follows. We take B to be [^] and V to be the space 
spanned by all functions of the form 



fa{b) 



1 a Cb 

Otherwise 



with a G [^] . With this choice, a subset T C B satisfying ([T|) is precisely a simple t-{v, k, A) design. 
We choose A = [^] , set 0a = fa and define a map 4> : B —?• by (l){b)a = 4>a{b)- 

Fix f > 1 and 1 < t < k < v. We assume without loss of generality that k < v — t since 
if k > V — t we have \B\ = (^) < (^) and hence Theorem 11.31 holds trivially and Theorem 11.41 
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holds vacuously (by taking the complete design in Theorem ll.3| and by noting that necessarily 
(^) — iV < (^) in Theorem ll.4p . This assumption will be needed shortly to show that {4>a}, a ^ A, 
form a basis for V (that is, that they are linearly independent). 

We shall now verify the conditions of Theorem 12.41 for V. First, the boundedness condition for 
V trivially holds with constant C2 = 1 by our choice of (p. Second, we observe that X^^g^ (pa is 
the vector with all coordinates equal to (^) . Hence the constant functions assumption is satisfied. 
Third, to establish the symmetry condition let vr G S'[„] be a permutation on [v]. vr acts in a natural 
way on B (by permuting fc-sets) and on A (by permuting t-sets). We have that 

{(pa o vr)(6) = (paiTT{b)) = laC7r(fe) = l7r-l(a)cfe = '/'tt-i (a) (^) , 

and in particular (/^a o vr G ^ for all a £ A. The action of Sj^j on B is transitive, from which 
the symmetry condition follows. We continue to find the divisibility constant of V. We need the 
following result of Wilson jWil73j and Graver and Jurkat jGJ73j . 

Theorem 3.7. The vector in Q-^ all of whose coordinates equal A belongs to C{(p) if and only if 

^ ~ A = mod Q ~ for aUO<s<t. 
Define ci > 1 to be the minimal integer such that 

ci = mod Q for ah 1 < s < t. (22) 

We claim that ci is the divisibility constant of V. Indeed, since all the coordinates of ^beB '^i^) 
equal A^(^Z^) / (^,) = A^(j)/ (j) we see that this vector belongs to C{(p) if and only if 

\t-sJ \tl ^ ^ w ^ ^ ^ for all < s < t. 



fk—s' 
Kt-s 



The case s = simply means that N €z 7^, thus a comparison with (|22|) verifies that ci is the 
divisibility constant of V. 

It is useful to have a simple upper bound for ci. Define 



lcm(t) := 1cm <; ( * ) : 0<s<t\. (23) 



Observing that 



e Z for all 1 < s < t 



we deduce that ci < (^)lcm(t). We note that |Far09j shows that log(lcm(t)) is asymptotic to t and 
mentions effective bounds for it. The next claim provides a simple self-contained proof of a weaker 
bound which suffices for our needs. It follows from the claim that ci < (^)4* < (4ef/t)*. 

Claim 3.8. lcm(t) < 4* /or t > 1. 

Proof. Assume by induction that the claim holds up to t (checking also that it holds for t = 1) and 
let us prove it for t. For each < s < [|J we have 

t\ t\ _ t\ rii! _ t! ^ 1 
si(t-s)i \^]isnt-s)i [i]!.! ii 

I 2 1+-'- 
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Since the product of every ni consecutive integers is divisible by ml (since (^) is an integer for 
every a) , using the symmetry of the binomial coefficients and the induction hypothesis we have 



1cm 



0< s <t 



}) 



1cm 



< 1cm 




< 



< 2*4l-5J < 4* 



□ 



Thus, to verify the conditions of Theorem 12.41 it remains only to verify the boundedness as- 
sumption for V-^. We do so by applying the local decodability claim, Claim [3121 to the map 4>- Let 
a £ A = [^] . Let u G [^^J be a any set of size k + t such that a C u (here we use our assumption 
that k < V — t). We denote by [^] C B the family of subsets of u of size k. Define 7a, « G as 



«=o fee[;;]:|anfe|= 



■1) 



t-s 



S! 



1)1 



(k-t-iy. 



Ub, 



where Uh G {0, 1}^ is the unit vector with 1 in coordinate b. 
Claim 3.9. (/>(7a,«) = j^rty. ' for all a £ A. 

Note that the claim implies, in particular, that the {(f)a), a £ A, are independent and thus 



dim{V) = \A\ = 

The next technical claim is used in the proof of Claim 
Claim 3.10. Let a > b > andc> 0. Then 



(24) 



We set f") = whenever n < 

\m/ 



m. 



i=0 



c + i 
b 



0. 



Proof. Let f{a,b,c) = EUi-^Y il) i"V) ■ b,c> we have C=+^) = (%i+^) + (^-^+^) and 
hence f{a,b,c) = f{a,b,c — 1) + f{a,b — l,c — 1). So, it is enough to verify the claim whenever 
6 = or c = 0. If 6 = then /(a,0,c) = Ej"=o(-l)* (i) = since a > I. If c = then 

/(a,6,o) = E-=6(-ir(:)a) = {t)EUi-^y{tl) = o- □ 

Proof of Claim [XPl It is clear from the definition that (p{^a,u)a' = if a' ^ u. So, we restrict our 
attention to a' C u. For a' = a the contribution is only from sets with s = \a Ci b\ = t, of which 
there are {fjLt) j hence 

A; \ . k\ 



4>{'la,u)a 



k - t 



tl 



{k-t)l 

We now need to verify that 4>{'^a,u)a' = for all a' d u^a' ^ a. Let us denote ^ = |a R a'\ where 
< £ < t. The contribution to 4'{'^a,u)a' comes only from sets b for which a' C b. The number of 
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these sets with |an 6| = s is {k^t^^s+i) = (t-DC" moreover, that s > We have 



(fc-t - 1)! 



(t-t- l)!(fc - i + £)! (^-i^fl,-s-l-\^ 



s=l 

t 



{k-t-i)\ 

Recalhng that t — ^ > we now apply Claim 13.101 with a = t — = t — I — l,c = k — t — 1 and 
conclude that (l){'^a,u)a> =0. □ 

In order to obtain tight bounds, we will divide 7a, « by a factor common to all the coefficients 
appearing in it. Note that 



s\{k-s-l)\ fk-s-l\/t"- ^ 



{k-t-l)\ \k-t-l 

and hence 



t\. 



7:,„:=i!^-7„€Z« (25) 



We continue to show that is locally decodable. We have 



b G [I] for which |a n 5| = . is (*) = (*) (^); and flgEf^ = C=7 VCl')- Hence 



and we recall that (^)lcm(t) < (^)4* by Claim \3M To bound ||7a„||i observe that the number of 

s\(k-s-iy. 

ti{k-t-iy. 



ITa.M II 1 



implying that (p is locally decodable with bound C4 = 8*(^) < (8e • k/ty. Finally, since \ A\ = (^) < 
{ev/ty we obtain from Claim [3^ that V-^ has a cs-bounded integer basis in ii with C3 < 2c2Ci\A\ < 
{Aev/tfK 

We have verified the conditions of Theorem 12.41 with \ A\ < {ev/tY,ci < (4ef/t)*,C2 = 1,03 < 
(4et;/t)^* and thus we establish the existence of a simple t-{v,k,X) design of size \T\ < {cv/tY^ for 
some universal constant c > 0, proving Theorem 11.31 

We turn to estimate the number of designs of a given size via Theorem 12.51 To this end we 
consider (j) as a, B x A matrix and need to calculate the parameter piV) from ()13p . We rely on 
a theorem of Wilson |Wil90] giving a diagonal form of and on a theorem of Bapat |BapOO| 
calculating the eigenvalues of 

Theorem 3.11. f WiW(A Theorem 2] There exist a (^) x (^) matrix E and a (^,) x matrix F , 
both with integer entries and satisfying \ det(ii^)| = | det(F)| = 1, such that M := Ecp^F has Mij = 
if i ^ j and has diagonal entries (^Zg) with multiplicity (^) — (3^]^) for < s < t (with (^^^ := 0). 
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Theorem 3.12. jBapOO ^ Theorem 4] The eigenvalues of cp^cp are (^_*) C^lj**) with multiplicity 
- (/i) forO<s<t (with := 0). 



The two theorems immediately imply that 
det(£(0)) 



P{V) 



nt (k-s\{"s)- 
s=0 {t-sj 



nUo (t:)c 



k—s\ /v—t—i 
k-t 



n 

s=0 



/v—t—s\ 
\ k-t ) 



Theorem 11.41 now follows by an application of Theorem 12.51 (recalling that dim(y ) = by 

We remark briefly on a possible approach to proving Theorems 13.111 and 13.121 via representation 
theory (see Section r3.4.1l for some background), though we neither require nor develop this approach 
here. One may naturally identify the set [^] with tabloids of shape {v — k, k) as the numbers 
appearing in the second row of the tabloid. Thus, may be identified with the Young module 
U(v-kk)- Similarly, M"^ may be identified with Ui^_tt\. For any m, the decomposition of Ui^ 



' {v~-m,m) 



into irreducible representations is C/(i,-m,,m) = ffi^o^(f-s,s)- Since 4> intertwines the action of Sn on 



{v—k,k) 



and U(^^_t,t)i Schur's lemma implies that (p and 



representations for ?7(^_/c^fc) and U(^y_t^t)i and act as scalars from V(j, 



* are diagonal in the basis of irreducible 
s) to itself, < s < t. 



Finally, for each s, the scalars appearing in these actions can be determined by considering the 
action on some particular vector in V^y_g^gy Choosing bases appropriately one obtains that the 
scalar for the (p action is 



)-t-. 

k-t 



) and the scalar for the action is both with multiplicity 



3.4 t-wise permutations 

We prove Theorem 11.61 in this subsection. We recall the relevant definitions from the introduction. 
A family of permutations T C Sn is called a t-wise permutation if its action on any i-tuple of 
elements is uniform. In other words, for any distinct elements ii,. . . ,it G [n] and distinct elements 
31, ---Jt e [n]. 



|{7r G T : 7r(ii) = ji, . . . ,7r (it) = jt}\ 



1 



\T\. 



n{n — 1) • • • (n — t + 1) 

Our general framework includes t-wise permutations as follows. We first set notations. Let 
[n]t := {(ii, . . . , ij) : zi, . . . , G [n] distinct} denote the family of t-tuples of distinct elements. For 
/ = (ii, ■ ■ ■ ,it) G [n]t and n £ Sn define 7r(J) := (7r(zi), . . . ,7r(zf)) G [n]t. We take B = Sn and W 
to be the space spanned by all functions of the form 



7r(7)=J- 



(26) 



where [n]t (we changed notation for the subspace from 1/ to in this section to avoid con- 

fusion with notations arising later which are related to the representation theory of the symmetric 
group). With this choice, a subset T <Z B satisfying ([T]) is precisely a t-wise permutation. We now 
establish the conditions of Theorem 12.41 We will show that: 



1 . The divisibility constant of W \s ci = ^^"'^■j, . 

2. W has a C2-bounded integer basis in with C2 = 1. 

3. W-^ has a cs-bounded integer basis in £i with C3 = (t + 2)!. 
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4. The group Sn acts on W transitively. 

5. The space W contains the constant functions. 

6. The dimension of W equals the number of permutations in Sn with longest increasing subse- 
quence of length at least n — t. It satisfies dim(VF) < < n^*. 

With these conditions, Theorem 12.41 immediately implies Theorem 11.61 

A few conditions are easy to verify. First, W contains the constant functions since for each 
I £ ["-Iti Z]je[n]t fi,J constant function 1. Second, any spanning subset of the functions fi^j 

is an integer basis for W with norm C2 = 1. Third, observe that Sn acts naturally on B = Sn 
by composition. Each o" E is a symmetry of W since for any [n]t, 

{fl,J ° o-)(vr) = //,j(o-(7r)) = = l^(7)=^-i(j) = //,a-i(j)(7r) G W. 

The action of Sn on B is transitive, from which the symmetry condition follows. Fourth, dim(14/^) < 
|[n]tp since the span W. 

In order to find the divisibility constant of W and establish that W'^ is spanned by integer 
vectors with small ii norm, we will need some basic facts regarding the irreducible representations 
of the symmetric group, which we describe next. We will follow the notation of [FHOlj Chapter 4], 
but also refer the reader to |Jam78) for the necessary background. 

3.4.1 Irreducible representations of the symmetric group 

Partitions. A partition A of n is a vector (Ai, . . . , A^) of some length i such that Ai > • • • > A^ > 1 
and X]i=i ~ referring to Aj for i > £ we adapt the convention that Aj = for such i. The 
conjugate partition A' is defined as (A'^, . . . , A'^) where m = Ai and A'^ := \{j : Aj > The 
dominance partial order > on partitions is defined by 

j j 

A ^ /i if and only if Aj > fii for all j > 1. 

i=l i=l 

We let > stand for the lexicographic total order on partitions. It is well-known that the lexicographic 
order extends the dominance order, and that A > if and only if /x' > A'. We denote by Vn the set 
of all partitions of n. 

Young diagrams, tableaux and tabloids. Let A G Vn- Associated with it is the Young 
diagram of shape A (in English notation). A tableau of shape A is a filling of the Young diagram 
of shape A with the integers 1 to n, with each integer occuring once. We say that two tableaux 
are row- equivalent if they are the same up to the order of the numbers in each row. A tabloid 
of shape A is an equivalence class of tableaux for the row-equivalence relation. We denote the 
tabloid associated to the tableau T by [T] . Sn acts on tableaux with the permutation action on the 
numbers in each tableau. This induces a corresponding action on tabloids. The column stabilizer 
of a tableau T of shape A is the subgroup Qt < Sn of permutations preserving the columns of T. 

Irreducible representations, Young modules and Kostka numbers. The irreducible rep- 
resentations (over C or Q) of the symmetric group Sn are in correspondence with shapes A S Vn- 
We denote by V\ the irreducible representation corresponding to A. We let U^, sometimes called 
the Young module or permutation module of shape ^, be the module whose basis is all tabloids of 
shape /i, equipped with the action of Sn on tabloids. Each Young module is isomorphic to a 
sum of irreducible representations. The Kostka number K\^^ is the multiplicity of the irreducible 
representation V\ in f/^. It is known that i^A,^ = unless X> fi and Ka,a = 1- 
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The group algebra and Fourier analysis. We denote by C5„ the group algebra of Sn, the set 

of functions / : — t- C endowed with the product 

(/*ff)(vr)= Yl /(^McT-V). 

We fix once and for all a matrix representation for each irreducible representation V\. Then the 
functions Vx{-)ij : — )• C for A G 7^^ and 1 < i,j < dim(yx) are linearly independent and span 
CSn- Moreover, extending Va(-) linearly to all of CSn, 

Vxif * g) = Vx{f)Vx{g) for /, 5 G C5„ and A G P„. 

In addition, / = if and only if Vx{f) = for all \ £ Vn- 

3.4.2 Two bases for W and the divisibility constant 

Define W to be the span of the functions {Vx{-)i.j ■ Xi > n — t,l < i,j < dim(yx)}- As all the 
s-i'^ linearly independent, we have 

dim(VF') = dim(yx)^. (27) 

XeV„:Xi>n~t 

In this section we show that W = W and define a combinatorial basis which is useful in determining 
the divisibility constant of W . 

Claim 3.13. W C W' . 

Proof. It suffices to show that fj^j G W for all I, J £ [n]t. Consider the representation Rt of the 
action of 5^ on t-tuples, defined by 

Rt{'^)l,J = K{I)=J = fl,j{'^) for vr G S'n and /, J G [n]t. 

It is simple to see that Rt is isomorphic to the Young module for /i = (n — t, 1, 1, . . . , 1) G Pn- 
Indeed, this follows by identifying each / G [n\t with the tabloid of shape ^ having the elements 
of /, in order, as the elements in its first column at rows 2, . . . , f + 1. By definition of the Kostka 
number -f^A,^, if the irreducible representation Vx appears in the decomposition of [/^ then A > ^, 
which occurs if and only if Ai > n — t. In conclusion, the decomposition of Rt into irreducible 
representations contains only Va with Ai > n — t. Passing to the basis of irreducible representations, 
we conclude that fi^j G W for each I, J £ [n]t, as required. □ 

It is not difficult to use the same representation-theoretic methods to show that W = W' , but 
we proceed by a different route in order to identify also the divisibility constant of W. 

For a G Sn, let LIS((t) denote the length of the longest increasing subsequence of a. Define 

A:={a £ Sn ■■ US{a)>n-t}. (28) 

Claim 3.14. |^| = dim(VF')- 

Proof. The Robinson-Schensted correspondence |Rob38[ ISch61| shows that 

\{a £ Sn : LIS(^7) = r}\ = ^ dim{Vxf. 

AeP„:Ai=r 

Summing over n — t < r < n and comparing with (127p concludes the proof. □ 
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We now define a set of functions (fa), cr ^ A, forming a basis of W. For each a £ A, let 
S{a) C [n] be the indices of an (arbitrary) increasing subsequence in a of length LIS(cj). That is, 
S{a) = (ii, . . . ,?Lis(o-)) some indices satisfying ij+i > ij and a{ij^i) > cr{ij). Define 

I otherwise 

It is clear that the functions are in W since |5'((t)| > n — t. Let ^ stand for the lexicographic 
order on 5„. I.e., vr ;^ o" if there exists a j such that 7r(i) = a{i) for all i < j and 7r(j) > o"(j). 

Lemma 3.15. For each a G A, /o-(o") = 1 a?^c? ^//^(vr) = 1 i/ien vr ^ o". 

Proof. fa{o-) = 1 by the definition of fa-- Suppose that /o-(vr) = 1. Then 7r{i) = a{i) for every 
i ^ S{a) and {vr(i) : i G 5'(o")} = {(T{i) : i G ^(fT)}. Since S{a) are the indices of an increasing 
subsequence in a it follows that vr appears after a in the lexicographic order. □ 

Now let : — )• be the matrix whose columns are the f^- It follows from Lemma 13.151 
that when the rows and columns are sorted by the lexicographic order on permutations, then (j) 
is in column-echelon form. Consequently, the columns of (p are linearly independent and hence 
dim(VF) > Combining this fact with Claims [3.131 and 1 3.141 shows that W = W and that the 
columns of (f> form a basis for W. In addition, the column-echelon form of <f> implies that C{(f>) = Z"^. 
Now, if (T G j4 has LIS (a) = n — I for some < ^ < t then 

E '^(^)- = ^\U^Srr. vr(j) = aU) Vj ^ Sia)}\ = 

Hence ^ Y.b&B Hb) G = £(0) if and only if N is a multiple of ^^"'^-ji , implying that ci — ^^"'^-^1 
is the divisibility constant of W. 

We end the section by giving an alternate characterization of W. 

Claim 3.16. A function f G CS'„ satisfies f £ W if and only if V\{f) = for every X £ Vn with 
Xi <n-t-l. 

Proof. We first recall the orthogonality relations for irreducible representations which state that 
E ^^li'^~^)i,j^^MkI = '^A,At<^i,fc'^j,f A,/i G P„, 1 < iJ < dim(l/A), l<k,£< dim(y^). 

(30) 

We continue by observing that / G if and only if / G where / is defined by /(vr) = /(tt ^). 
Indeed, / G if and only if / = jg[„]j for the functions defined by ([26]) and some 

coefficients a/,j. The observation now follows since //,j(vr^^) = fjj{7r). 

Now let / G C5n. Decompose / in the basis of irreducible representations of Sn as 

dim(y,0 



where Q!^,j,j G C. Then, 



dim(yp) 

^a(/)=E E E «/^«-^M(^"')M^A(vr). 

7rG5,i /iSP,! i,j = l 
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Thus, (pOj) implies that 

We conclude that V\{f) = for all A with Ai < n — i — 1 if and only if Oifj,^ij = for all /x with 
/ii <n — t — 1. That is, if and only if / G W . Since W = W and f & W \i and only if / G we 
conclude that V\{f) = for all A with Ai < n — t — 1 if and only if / G W . □ 

3.4.3 Antisymmetrizers 

We introduce the column antisymmetrizer of a tableau T, 

br := ^ sign(cj) • a, (31) 

creQr 

which is an element of the group algebra C5„. We study in which irreducible representations 
the antisymmetrizers have a non-trivial action. We start by studying this question for the Young 
modules since these have a simpler combinatorial nature. 

Claim 3.17. Let T be a tableau of shape A G Vn- Then 

(V Uxibr) / 0. 

(ii) If U^ibx) / then A > 

Proof. We first recall the basic properties. For tabloids [T'], [T"] of shape /x and tt G 5„ we have 

^m(^)[T'],[T"] = l7r([T'])=[T"]- 

We first establish (i). We have 

U\{bT)[T],[T] = ^ sign(7r)l^([r])=['r] = 1, 

tt&Qt 

since any permutation in Qt except the identity maps [T] to a different tabloid. In particular 
Uxibr) + 0. 

We next establish [ii). To do so, we show that for every tabloid \S\ of shape [i we have bT\S\ = 0. 
Assume for a moment that there exist two elements a, 6 in the same row of S and the same column 
of T. Define Sa,b = ^(Id — (a, b)) G CSn where Id is the identity permutation and {a,b) is the 
permutation that swaps a and b. On the one hand bx * Sa,6 = bx, and on the other hand Sa,b[S] = 0. 
Hence 

bT[S] = {bT * Sa,b)[S] = brisaAS]) = 0. 

So, if U^{bT) 7^ such a pair cannot exist. Thus, all the elements in the first column of T appear 
in different rows in S; all the elements in the second column of T appear in different rows of S, etc. 
This implies that fi' > A' where fi' , X' are the conjugate partitions to fi, A. This in turn implies that 
X> H as we claimed. □ 

We now derive the analogous claim for the irreducible representations. 

Claim 3.18. Let T be a tableau of shape A G Vn- Then 

(i) VxihT) + 0. 
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(ii) If V^ibr) + then X > ^i. 

Proof. Let G "Pn be such that A ^ ^. K^^^ = 1 and hence appears in the decomposition of 
into irreducible representations. Thus, the fact that f/^(6T) = by Claim [3?T71 implies that 

V^(^t) = 0, proving {ii). 

Now, since K\ \ = 1 and Kt-^\ = unless r > A we have that U\ decomposes as the sum of 

Vx plus other irreducible representations Vr with r l> A. Since Ux^bx) 7^ by Claim [3?T71 and 

Vribx) = when r [> A by part (ii) we deduce that Va(&t) / 0, proving (i). □ 

3.4.4 Spanning vectors for W-^ 

We will prove the following lemma in this subsection. 

Lemma 3.19. Let f G CSn- Then f^WiffbT*f = for all tableaux T of shape A with 
Ai = n — t — 1. 

We first show that this gives a basis of integer vectors for W'^ of small £1 norm. 
Corollary 3.20. W'^ has a {t + 2)\-bounded integer basis in £1. 

Proof. Let T be a tableau of shape A € Vn with Ai = n — t — 1. The condition 6^ * / = is 
equivalent to 

sign(cr)/(cr"V) = for all vr E Sn- 

ct&Qt 

The £1 norm of the vectors in W-^ these define is \Qt\^ which we next derive a bound on. Let 
A' = {X'l, . . . , A'„) be the conjugate partition to A. Then \Qt\ = Hi^i Observe that for any 
a, 6 > 0, 6 + 1 = C'l^) < ("+1;+^) and hence (a + 1)!(6 + 1)! < (a + 6 + 1)!. Thus 

mm / m \ 

IQr|=n^'^! = 11(^^-1 + 1)'^ h + Y.(K-^)y = a + n-Xi)\ = {t + 2)\. □ 

1=1 i=l \ 1=1 / 

For the proof of Lemma 13.191 we need the following auxiliary claims. 

Claim 3.21. Let fj, €z Vn satisfy < n — t — 1. Then for every tableau S of shape jj, there exists 
a shape X £ Vn with Xi = n — t — 1, a tableau T of shape X and an element g G CSn such that 
bs = g* bx. 

Proof. Define an element in a tableau as maximal if it is the last element in its row and its column 
(in English notation, it is the rightmost element in its row and the bottom element in its column). 
We construct T from S by iteratively moving maximal elements which are not in the first row to 
the end of the first row, until the first row of T contains exactly n — t — 1 elements. It is simple to 
verify that this process guarantees that Qt is a subgroup of Q^, since each column of T is contained 
in a column of S. Let {cxi, . . . , be representatives for the left cosets of Qt in Qs- Then we have 

bs = i^Yl ^isn(cTi) • CTj^ * bx- □ 

Observe that if T, S are tableaux of shape A and a G Sn satisfies (t{T) = S, then bs = abTcr~^. 
Define, for X £ Vn, 

S of shape A cJdSn 
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where T is an arbitrary tableau of shape A. Denoting by Tr the trace of a matrix, the definition 
implies that 

Tr(y^(cA)) = n\ Tr(y^(6T)) for /i G P„. (33) 

Claim 3.22. V^{c\) is a multiple of the identity for all A, G Vn- Moreover, if there exists a 
tableau T of shape A such that V^(6t) 7^ then V^(ca) 7^ 0. 

Proof. Fix X, fj, £ Vn- By ()32p we have for any vr G S'„, 

Hence Schur's lemma implies that V^(ca) is a multiple of the identity. 

Now suppose that V^(6t) 7^ for some tableau T of shape A. Observe that br * bx = |Qt|&t- 
Thus |(5t|^^V"^(^t) is a projection matrix. Hence, V^(6t) 7^ implies that Tr(V^(6'jr')) 7^ 0. Thus 
Tr(V;,(cA)) 7^ by from which V;.(ca) 7^ follows. □ 

Proof of Lemma \3.19l Suppose first that f €W and let T be a tableau of shape A for some A G Vn 
satisfying Ai = n — t — 1. We will show that 6t * / = by showing that V^(5r * /) = for all 
H £ Vn- Fix fj. £ Vn- If /Ui < n — t — 1 we have V^(/) = by Claim 13.161 If /ii > n — t we have 
Vi,{bT) = by Claim [33Hi;ii). Thus in aU cases V^,{bT * f) = V^,{bT)V^,{f) = 0. 

Now suppose that / ^ W. By Claim [3.161 there exists some G Vn with ^1 < n — t — 1 such 
that V^(/) 7^ 0. Putting together Claim [3.18( i) and Claim [3.221 we have that V^(c^) is a non-zero 
multiple of the identity. Thus V^(c^ * /) = ^(c/i)V^(/) 7^ 0. By the definition (p2|) of c^, this 
implies that there exists some tableau S of shape ^ such that V^{bs * /) 7^ 0. By Claim [3?2T| there 
exists a shape A G Vn with Ai = ?7- — t — l,a tableau T of shape A and an element g G CS'n such that 
bs = 9 * bx- Since V^{g)Vf^{bT * f) = V^iibs * /) 7^ 0, we conclude that 6t * / 7^ 0, as required. □ 

3.5 The number of t-wise permutations 

One may use Theorem 12.51 to estimate the number of t-wise permutations of a given size, as we 
did for orthogonal arrays and f-designs. To this end, one needs to calculate the parameter piV) 
defined by p3|) . We leave this calculation for future work but present in this section the results of 
numerical calculations which give some evidence that p{V) has a nice product structure. 

Let B = Sn and A be as in (j28|) . Let (p be the B x A matrix whose columns are given by the 
(/cr), a £ A, defined in (j29p . As proven in Section [3.4.21 the columns of (j) form a basis for W and 
£(0) = Z^. Thus 

det(£((/>)) 1 



P{V) 



V^det((/)V) \/det( 



for this matrix (/>. Below we present the results of numerical calculations of det{(j)^(j)) for a few small 
values of n and t. 



3 • 2-^ 



n\t 


1 


2 


3 


3 


3 • 2 


1 




4 


3-218 


3 • 2^ 


1 


5 


59317219 


5932284 


5 


6 


5 • 3^2294 


5643I622276 


? 


7 


7255373382I12 


710056536272I150 


? 


8 


J . 55O3IOO2308 


? 


? 
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4 Proof of main theorems 



We prove our main theorems, Theorem 12.41 and Theorem 12.51 in this section. We start by stating 
a local central limit theorem from which our main theorems will follow. 

4.1 Local central limit theorem statement 

Let -B be a finite set and y be a vector space of functions from B to the rational numbers Q. Let 
{(pa '■ B — )• Q}aeA be a basis for V , where A is some finite index set of size dim(y). This basis is 
arbitrary for now but will be chosen in a convenient way in the next subsection. Let (p : B ^ Q"^ 
be defined as (j){b)a = (f>a(b). It may be useful to think (p as a B x A matrix, whose entries 
are (pa{b)- Fix < p < 1 and define T to be a random subset of B, with each point of B chosen 
independently into T with probability p. In other words, we let {T;,}, 6 E be a collection of 
independent identically distributed random variables with P[Tb = 1] = 1 — P[Tb = 0] = p and let 
T := {6 G B : Tfe = 1}. Define 

beB 

where C{(p) is the lattice in Q"^ generated by {(p{b)}, h £ B. Our main Theorems will follow from 
a precise estimate of the probability P[X = E[^]]. Along the way, however, we will pass through 
estimating P[X = A] for an arbitrary point A G ^{(p) (though our estimate will only be meaningful 
for A close to E[X]). Since this is a useful result in itself, which also requires less assumptions, we 
encapsulate it in the following theorem. We note that the mean of X is given by 

beB 

and the covariance matrix of X is given by 

:= E[(X - K[X])\X - E[X])] = p{l - (35) 

where (p'^'cp is the symmetric positive definite ^4x^4 matrix satisfying {(p^(p)a,a' = Ylb^B 4>{b) a(p{b) a' ■ 
The positive definite property follows from the fact that the {(pa}, a ^ A, are linearly independent. 

Theorem 4.1 (Local central limit theorem). There exists a constant C > such that the following 
is true. Assume that the following conditions hold for some integers C2,C3 > 1, 

1. Boundedness ofV: V has a C2-bounded integer basis in l^o- 

2. Boundedness ofV^: V-^ has a c^-bounded integer basis in ii. 

3. Symmetry: for any bi,b2 £ B there exists a symmetry n of V satisfying 7r(6i) = 62- 

// 

min(p|5|,(l -p)\B\) > C-C2cidim(y)^log(2c3dim(y))6 
then for every A G ^{(p), 

p[X = A] = ..^^y^ '^^^ / -i(A-E[X])*E[X]-(A-E[X]) ^ ^^^A (3g) 

(2^)^^^detS[X] ^ ^ 
with \5(X)\ < gdi"'(^)'a°g(2c2dim(y)))^/2 

I ^ ^1 - ^min{p\B\,{l~p)\B\) 
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We point out explicitly that this theorem does not require the divisibility or constant functions 
assumptions of our main theorems, Theorem 12.41 and Theorem 12.51 In Subsection 14.71 below we 
explain how our main theorems follow (easily) from this local central limit theorem and the extra 
assumptions by applying the theorem with A = E[X]. 

We also note that the local central limit theorem does not depend on our choice of basis (p in 
the sense that if it holds for one basis it holds for all bases. We make this fact more explicit in 
Subsection 14.81 where we state an equivalent basis-free version of the theorem. 

The local central limit theorem is proved in Subsections 14.21 to 14.61 below. 



4.2 Fourier analysis 

We continue with the notation of the previous section and assume the conditions of Theorem 14.11 
We fix {(pa}, a G ^ to be the basis of integer-valued functions, satisfying ||(/)a||oo < C2 for all a G A, 
whose existence is guaranteed by the boundedness condition for V. We also make the simplifying 
assumption 

1 

Near the end of the proof we will show how to get rid of this assumption by utilizing the bijection 
B\T. Finally, we denote 

N ■.= p\B\. 

We stress that N need not be an integer in our proof of the local central limit theorem. However, 
when we later deduce our main theorems from the local central limit theorem, we will choose p in 
such a way that N will be an integer. 

Our main technique to study the distribution of X is Fourier analysis. The Fourier transform 
of X is the function X : — > C defined by 

where {X,6) := '^^^j^XaOa- Define the dual lattice L to C{(p) (sometimes called the annihilator 
or reciprocal lattice of £{(/))), as the set of vectors in having an integer inner product with the 
vectors of C{(l)). That is, 

L:={e : {9, \) £ Z VA € £(0)}. 

Noting that £(</>) has full rank, since the {4>a}, a £ A, are linearly independent, it follows that L 
is also a full rank lattice and the relation det(>C((/))) det(L) = 1 holds. Since g^'^*^"'^'"^ = 1 almost 
surely when a £ L, we see that X is L-periodic, 

x{6 + a) = x{e) ye g m^, « g l. (37) 

The covariance matrix of X provides a natural norm to work with in Fourier space. Define 

R := 0V, 

so that = p{l — p)R by (j35p . As mentioned, i? is a symmetric positive definite Ax A matrix 
satisfying Ra,a' = J2beB 4>{^)a4>{^)a' ■ We define a norm in Fourier space by 



l-R 
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Balls in the i?-norm are denoted by 

BR{e) := {eeR^ : \\0\\r < e}. 

Let D be the Voronoi cell of in the lattice L, with respect to the i?-norm. That is, 

D := {9 eR^ : \\e\\R<\\e -a\\R VaGL\{0}}. (38) 

Observe that D is a bounded set since L has full rank. Moreover, a + D and a' + D are disjoint 
for distinct a, a' € L, and LlaeL{c( + D) covers all of except a set of Lcbcsgue measure zero 
(since only a Lcbcsgue measure zero of points in R^ are equidistant to two points in L). It follows 
that Vol(L') = dct(L) = det{jC{(f)))~^ , where Vol denotes Lebesgue measure, and that we have the 
following inversion formula. 

Fact 4.2 (Fourier inversion formula on lattices). 

P[X = A] = det{C{(p)) [ X{e)e-'^^'^^^'^Ud VA € £(</>). 
Jd 

Thus, our goal from now on is to understand the Fourier transform of X. We start with an 
explicit formula for X. 

Claim 4.3. We have 

Proof. By definition X = '}2beB '^b^'i^) j where Tf, G {0,1} are independent with F[Ti) = 1] = p. 
Thus 

= J]E[e2-^^''<<^W'^>] = J](l -p + pe2-^<<^W'^>). □ 

An important ingredient in controlling X is the following property. The terms {(j){b),9) which 
arise in the Fourier transform X{6) are tame in the following sense: if most of them are small, 
then all of them are small; and if most of them are close to integers, then all of them are close to 
integers. This is captured by the following lemma. The constant C2 appearing in the lemma is the 
one given in the boundedness assumption for V. 

Lemma 4.4. There exists a universal constant C > such that if we set 

M:=C{\A\\og{2c2\A\)f/^ (39) 

then for every 6 G M"^ ; 

m^ |(</)(6), e)\<M(^ Y.^m. o?\ = M\\e\\R. 

2. Write {(p{b),9) = m + r^, where rib e Z and r^ G [-1/2, 1/2). Then 

1/2 

/ I . — , „ \ 

max Irftl < M , , „ 
&eB ' ' - \\B 
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We note that the proof of the lemma uses only the boundedness assumption for V and the 
assumption that V has a transitive symmetry group and it is the only place where the symmetry 
assumption is used. We prove Lemma 14.41 in Subsection 14. 3i The main ingredient in its proof is 
the notion of local correctability of the map (f). 

Our next step is to approximate X near zero. The next lemma achieves this by approximating 
X(6) by its Taylor expansion at zero for 6 £ Bn^e). 

Lemma 4.5 (Estimating the Fourier transform near zero). For all < e < and 6 G Bji{e), 

X{9) = exp(27r? • {E[X],e) - 2tt'^ ■ O^XjO + 5(6)) 
where \6{e)\ = 0{M\\e\\%N). 

In this lemma as well as in the remainder of the paper we use the O(-) notation to hide universal 
constants, independent of all other parameters. We prove Lemma 14.51 in Subsection 14.41 We next 
derive an upper bound on the Fourier transform at points which are far from zero. Recalling that 
X{9) is an L-periodic function, such a bound can only hold for 6 bounded away from the points of 
L. We achieve this by requiring 6 to belong to D \ BR{e). 

Lemma 4.6 (Bounding the Fourier transform far from L). For all e > and 9 £ D \ Br{£), 

\X{9)\ < exp{-P^N) 

where /3 = /3(e) = min(e, ^). 

We prove Lemma 14.61 in Subsection 14.51 This lemma is the only place where we use the bound- 
edness assumption for V'^. 

4.3 Local correction 

A map tp : B ^ Z,^ is said to be locally correctable, if for any small subset E <Z B and any e £ E, 
we can express ip{e) as a short integer combination of {ip{b) : b £ B \ E}. This is an analog of the 
local correction property of codes, usually studied over finite fields. 

Definition 4.7 (Locally correctable). A map if) : B ^ l/" is called (5, s)-locally correctable if for 
any E C B of size 1 < < S\B\ and any e £ E, there exists a 7 G with WjWi < s such that 

V'(e) = J2 

b&B\E 

Regarding ip as a B x A matrix, we see that local correctability is actually a property of the 
space W spanned by the columns of ip and does not depend on the particular choice of basis given 
by ip. Still, it is convenient to define local correctability this way since our usage for it will be 
with a particular choice of basis. We say that a permutation vr G Sb is a symmetry of ip if it is a 
symmetry of W. We show that the assumptions that tp is bounded and has a transitive symmetry 
group imply that it is locally correctable. We use the notation HV'lloo := niaxag^^fegs |7/^(6)a|, 
Hi) ■■= EbeB for 7 G and ^(S) = Efee5 ^(^) fo^ ScB. 

Lemma 4.8. Let ip : B ^ Z"^ he such that ||V'||oo < c and such that the symmetry group of ip acts 
transitively on B. Then ip is {6, s)-locally correctable for some s = 0{\A\ log(2c|>l|)) and S = -g^. 

We need the following auxiliary claim. 
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Claim 4.9. Let if) : B ^ Z"^ he such that ||V'||oo ^ c. Then for any subset S <Z B of size 
\S\ > 0{\A\ log(2c|^|)) there exists a vector j G {—1,0, 1}"^ having at least j\S\ non-zero coordinates 
and satisfying ^"(7) = 0. 

Proof. The claim follows from the pigeon hole principle. Let a G (0, 1) be such that for any n > 1, 
the number of strings in {0, 1}" having less than |n ones is at most 2"". Fix S C B. For a subset 
S' C S, we have that is an |j4| -dimensional integer vector with entries bounded by c\S\ in 

absolute value and hence the total number of distinct values for it is bounded by (2c|5| + l)'"^'. 
The number of subsets of S is 2l'^L Hence, if 

2(i-a)|s| ^ (2c|5| + 1)1^1 (40) 

there must exist two distinct subsets Si, 5*2 such that ip{Si) = ip{S2) and |5i A ^2! > j\S\ (where 
we use A to denote symmetric difference). We then set 7 = — Igj and have that ipil) = s-i^d 
7 has at least j\S\ non-zero coordinates. Now, it is a simple exercise to verify that the condition 
\S\ > 0(1^1 log(2c|^|)) with a large enough hidden constant implies (j40]l . □ 

Proof of Leninia \4.8[ Let s = 0(|j4| log(2c|^|)) be an integer larger than the lower bound on |S| 
given by Claim Set 5 = Assume that 6\B\ > 1 since otherwise the claim holds vacuously. 
Fix a subset E C B of size 1 < l-E] < and an element e G E. For each b G B, let nb be a 
symmetry of ip satisfying TTb{b) = e. 

Choose S uniformly among the subsets of size s of and choose 69 uniformly in S. Let 
7 G { — 1,0,1}'^ be some (random) vector having at least j\S\ non-zero coordinates and satisfying 
ipij) = 0, whose existence is guaranteed by Claim [5]9j Define the events 

^1 ■■= Ho = 0}, 

^^2 := {36 G S \ {60} satisfying Tiboib) G E}. 

We first show that ^llr]ft2 has positive probability. Indeed, this follows from the fact that P[f2i] < | 
by the properties of 7 and, observing that conditioned on 60 each b G S'\{6o} is uniformly distributed 
mB\{bo}, 

Now, assume that ilf n occurred. We may assume WLOG that jbo = 1 (since otherwise we may 
replace 7 by —7). Thus 

^{bo) = - Yl ^^m- (41) 

b&S\{bo} 

Since TTb^ is a symmetry of ■i/'j there exists an invertible linear map r : Q"^ — t- such that 
i^Hoib)) = T{ip{b)) for all b G B. Applying r to (gl]) gives 

V'(e) = r(V(6o)) = - 76r(V(6)) = - "fbi^M))- 

beS\{bo} bes\{bQ} 

Finally, implies that ^Tb^^{b) ^ E for all 6 G S* \ {60} and the lemma follows. □ 

We now derive Lemma l4.4[ 
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Proof of Lemma \4-4\ We have by Lemma 14.81 that (j) is {5, s)-locany correctable with s = 
0{\A\ log(2c2|^|)) and 5 = j^. We will establish Lemma [Ol with M = s/V6. 

/ \l/2 

Let us first prove the first item. Let /3 := ( j^y X^f^g^ and set E := {b £ B : 

1(0(6), ^)| > -^}- Then we must have that \E\ < 6\B\. If E is empty we are done. Otherwise, since 
(j) is ((5, s)-locally correctable, for any e G E we can express (/)(e) as (/)(e) = J2b^E ' '^(^) where 
X] |7f)| < s. Hence in particular, 

me),e)\ = \Y,7tm),e)\ <Y^ht\ ■ mh).e)\ <s-^. 

b^E b(^E ^ 

The second item is very similar. Let /3 := Ylbi^B kbP) ^"^^ E := {b £ B : \rb\ > -^j. 
Again, \E\ < 6\B\ and thus any e £ E can be expressed as 0(e) = "^^^E^b • 0(6) with ||7||i < s. 
Hence 

me),e) =Y,^b{m,0) =Y,7b{nb + rb), 
b(^E b(^E 

and in particular = ^b^E'ybfb (mod 1) (where we mean that the modulo 1 maps M to 
[-1/2,1/2)). Hence 

Fel < \lb\\rb\ < s ■ -j=. □ 

b^E 

4.4 Estimating the Fourier transform near zero 

We prove Lemma 14.51 in this subsection. Let 9 € BR{e) C M^. Recall that by Claim I^THl we have 
that 

x{e) = W (l-p+p.e2-^W^).^)) . 

beB 

Let us shorthand Xb = 2iT{(f)(b),0). By our assumptions that 9 £ Br{£) and Lemma 14.41 we have 

maxlxfel = 27r max I (0(6), 6*) I < 27rM||6'||R < -, (42) 
beB beB 4 

where the last inequality follows from the assumption that ||^||_r < e < Define the function 
/ : R — )- C given by f{x) := 1 — p + pe^^ . Then 

^(.0) = n fi^b). (43) 

beB 

Claim 4.10. If < p < 1 and \x\ < j then f{x) = exp(ipx — ^p(l — p)x^ + 6{x)) where \5{x)\ = 
0{p\x\^). 

Proof. Let y = p{e^^ — 1) so that /(x) = 1 + y. Our assumptions imply that \y\ < |e*^ — 1| < 
\/2 - \/2 < 1 so that log(l + y) = y - y'^/2 + 0{\yf). Now, y = ipx - px'^/2 + 0{p\x\^) and 
= -p^x^ + 0(p^|xp). Hence 

log(/(x)) = ipx — -^p{^ — P)^^ + 0{p\x\^). □ 
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Applying Claim [¥.101 to each term in (j43p we obtain 

Xie) = exp(27ri • p ■ ^) - ■ ^p{l - p) ■ ^^(0(6), 9f + 5(9)) 

beB beB 

= exp(27ri • {E[X],9) - I-k'^ ■ 9^Y\X\9 + b(9)) (44) 

where we recall that E[X] = pJ2beB ^i^) ^^^^ ^[X]a,a' = p(l — p) J2beB 't'{b)a'Pib)a' , for a, a' G 
A. The error term is bounded by 

m\ = o{pY,mb),9)f). (45) 

beB 

Applying (ji2]l again yields 

^ K</.(6),0)|=^ < maxK0(6),0)| • ^ K0(6),e)P < M\\9\\r ■ \\9\\%\B\ = M\\9\\%\B\, 
fees fees 

and since p|-B| = N we can bound the error term by 

|5(0)| < O(M||0|||iV). 

4.5 Bounding the Fourier transform far from L 

We next bound the Fourier transform far from in the i?-norm, proving Lemma [4. 61 Fix e > and 
9 € D\Bn{e). Our goal is to show that |^(^)| must be small. Let us decompose {(p{b), 9) = rih + rh 
where rif, G Z and G [—1/2, 1/2). Recall that by Claim HT3l we have that 

X{e) =ll{l-p + p- e2-^<'^('')'^)) = Yl{l-p + p- e^"^-''*') . (46) 
fees fees 

We need two auxiliary claims. 

Claim 4.11. |X(6')| < exp(— ^^^^ r^) < exp(— max^gs r^), where M is defined in (f39|) . 
Proof. It is simple to verify that for any |x| < 1/2 and p < 1/2, 

|1 -p + pe^''*''! < exp(-px^). 

Hence 

|X(0)|<exp(-p^r2)=exp(-^^r2). 

fees ' ' fees 

The second inequality now follows from the second part of Lemma 14. 4i □ 

Recall that C3 is the constant from our assumption that V'^ has a bounded integer basis in ii. 
We next argue that if max;,e_B kfe| < ^ then the vector {nb)beB belongs to the space V. 

Claim 4.12. //max^g^ |rf,| < I/C3 then there exists a G Q"^ such that {(j){b),a) = for all h ^ B. 

We remark that this is the only place in our proof where the assumption that V-^ has a bounded 
integer basis in £1 is used. 
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Proof of Claim \4^T^ Assume to the contrary that no such a exists. Then (n;,)6gB does not belong 



to V and hence it must violate some constraint of V . However, by assumption, V is spanned by 
integer vectors of li norm at most C3. Hence, there exists 7 G Z-^, ||7||i < C3 such that 

XI ^bUb / 0. 

Since both 7 and (n;,) are integer vectors, we must have that 

I ^Tfe^-bl > 1- 

beB 

However, we know that the vector [ub + rb)beB belongs to V (more precisely, to the span over M of 
the vectors in V). Hence 

'^ibim + n) = 0. 

beB 

Thus we conclude that 

beB 

This is, however, impossible if \rb\ < I/C3 for all 6 € i?. □ 

We now conclude the proof of Lemma 14.61 There are two cases to consider. Suppose first that 
maxftg^ \rb\ > —■ Then Claim IT. 1 1 1 implies that 

Now assume instead that maxftgg \rb\ < and let a G Q"^ be as in Claim l4TT2l By definition, a 
belongs to the lattice L. It follows that 

' ' beB ' ' beB 

Thus, by the definition (|38|) of the Voronoi cell D and the fact that 6 £ D and a G L, we deduce 
that 

llfll|2 ^ ^ \ ^ „2 



' ' beB 



Since 9 ^ 13R{e) we also have ||^||_r > e and thus Claim HTTT] shows that 



\X(e)\ < exp(-iVe2) if maxlrbl < — . (48) 

beB C3 



Taken together, (j47p and ()48p prove Lemma 
4.6 Proof of central limit theorem from auxiliary lemmas 

In this section we prove Theorem 14. II We start with a bound on the in-radius of D in the i?-norm. 
Claim 4.13. If £ < ^ then Br^e) C D. 
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Proof. Let ^ a £ L. By definition, {(f)(b),a) € Z for all b £ B. Since C{(f)) is of full rank and 
a 7^ 0, there exists some b £ B for which \ {(l){b), a)\ > 1. It follows from Lemma [4.4l that \\a\\ji > jj. 
Since o. is arbitrary, we deduce from the definition (|38p of that Bji^e) C for any £ < 

Now fix A E ^{(p) and recall from Fact 14.21 that 

P[X = A] = det(£(</.)) / l(0)e-2'^*<^'^>d0 VA G £(0). (49) 
Jd 

Introduce a second random vector Y € M"^ having the Gaussian distribution with mean K[X] and 
covariance matrix (that is, with the same mean and covariance matrix as X). Recall that the 

density function fy of Y equals 

exp(-l(x - K[X]YnX]-Hx - E[X])) 

frix) := > (50) 

(27r) — v^detS[X] 

and that the Fourier transform of Y equals 

9(6) := E[e2''*<'^'^>] = e'^^mx],e)-'i^''e^^x]e _ ^^^-^ 

Moreover, the Fourier inversion formula applied to Y yields 

fY{x)= [ y(0)e-2'^*<^'^>cie VxGM^. (52) 
Jra 

Theorem 14.11 will follow by showing that ¥[X = A] approximately equals det(>C((/)))/y (A). Fix 
< e < 2^ whose exact value will be chosen later (see ([58ll and (f59]) ). Combining (j49]) . ([52]) and 
Claim 14.131 we may write 

|P[X = A]-det(£(0))/y(A)| < 

det(£((^))( / \x{e)-Y{e)\de+ f \x{e)\de+ f \Y{e)\de\ (53) 

^JBR(e) JD\BR{e) JRA\BR{e) ' 

V ' V ' V ' 

= :/l =:Il =:h 

Our next lemma provides upper bounds on each of the above integrals. 
Lemma 4.14. There exists a universal constant c > such that: 
1- If £<jM '^^^ Me^N < c then 

M|^|3/2 



h < 



2. Ife<^ then 



3. If£^N>^ then 



2\/iV(27r)^y^det(S[X]) 



det (£((/.)) ■ 



h < ■ 



(27r)-Vdet(S[X]) 
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Proof. We start with the second item. By Lemma 14.61 if e < and ^ D \ Br{£) then 
|X(^)| < exp(-e2iV). Hence, 

/.<V„l(I>)e-'" = j^. 
We continue with the third item. By (jSip we have 

h = I e-2-'^'^I^]^de. (54) 

To evaluate the integral let G be a standard multivariate Gaussian random vector in (with 
mean zero and identity covariance matrix). Recalling that is a positive definite matrix, let 

be a symmetric positive definite matrix such that (S[X]~^/^)-^ = It follows that 

z := —nxr'^i^G 

27r ^ ^ 

has a Gaussian distribution with mean zero and covariance matrix Thus the density 

function of Z is 

f,^e) = (2vr)^7diIMe-'-''*^[^]'. (55) 



Comparing with (154p yields 

I3 = (27r)-^ det(S)-^P[||Z||ij > e] < (2^)"^ det(S)-5F[||G||| > 2T:'^e^N], (56) 
where in the last inequality we used that 

recalling that p\B\ = N and our standing assumption that p < ^. Now, The distribution of ||G||2 
is chi-squared with |^| degrees of freedom. Observing that Ee^H'^Ha = (l — 2i)~l^l/^ for t < 1/2 and 
fixing t = 1/4, Markov's inequality yields for any p > 41^41 that 

F[\\Gg >p]< = 2l^l/2e--/^ < e-''/^. 

Applying this result to (j56p and using our assumption that e^N > we have 

h < rr, . 



(27r) — Y^det(S) 

Lastly, we verify the first item. Using Lemma 14.51 and ()5ip we have 



'BR(e) 

where \5{9)\ = O(M||0||jjA^). Our assumption that Me^N < c for a sufficiently small c implies that 
\e^{9) _ i| < 2M||6l|||A^ for 9 G BR{e). Thus 

h < 2MN [ e-2-'^^^[^l^||0|||d0 < 2MN [ e-^-'''^^''^'\\9\\ld9 = 



(27r)^v/dit(S) (27r2)3/2^(27r)^y/d^t(S) 
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where we used again the relations (j55p and (|57p . Finally, observing that by Jensen's inequality, 
HWGWl] < = {3\A\ + 1^1(1^1 - 1))^/^ < 43/4|A|3/2 and that 0^ < \ finishes the 

proof. □ 

We make the choice 



2|^|logiV 

e:=\l ^ (58) 

and the assumption 

N>C' ■C2cl\Af\og{2cM\f (59) 
for some universal constant C" > chosen sufficiently large for the following calculations. It is 
simple to check that with these choices the assumption e < as well as all the assumptions in 
the items of Lemma 14.141 hold. Thus, ()50p . ()53p and Lemma 14.141 imply 



p[X = A] - ('^('^^^ g-l(A-E[X])*S[X]-i(A-E[X]) 

(2^)^^detS[X] 



< 



^ det(£(</.)) / M|yl|3/2 ^ (27r)^ Vdet(S[X]) ^„,2^ ^ g-i-'^'^^ 

" (2^)^Vdet(S[X]) V det(£(0)) | 

To compare the middle summand in the right-hand side with the others we use the following crude 
bounds on det(£((/))) and det(S[X]). First, det(£(0)) > 1 since £(0) C Z^. Second, since 

= p(l -V)Y. ^(b)aHb)a' < clN, 

beB 

by Hadamard's inequality, we deduce that det(S[X]) < c^^'^ N\'^\\A\\^\/'^ . Thus, ([58]) and ([MD imply 
(27r)^v/det(S[X])^„,.^^ (2^ci iV)^ |yL| ^ ^ 1 



det(£(,^)) - iV2|^l - 

1 

det(£((/.)) 



if the constant C" in ([59]) is large enough. Since also e <^ ^ < i -*^. ,^ we finally conclude that 



p[X = A] = ^^^^ ^W) / -i(A-E[X])*S[X]-(A-E[X]) ^ A (go) 

(27r) — ydetS[Xj ^ ^ 

where |(5| < . Recalling that A is an arbitrary point in £(0) and = p|-B|, we see that we 

have proven Theorem 14.11 in the case p < ^. 

We now get rid of the assumption p < Fix p > ^, let N := p\B\ and assume that 

\B\-N>C'- C24\Af log(2c3|^|)^ (61) 

holds. Recall that X = ^^g^ Th(p{b) with the {T^} independent, identically distributed and satis- 
fying P[Tfe = 1] = 1 — F[Th = 0] = p. Let us temporarily write Pp, Ep and Sp[X] for the probability, 
expectation and covariance matrix of X with a given p. Denote (j){B) := Y^beB 'Pi^)- fact that 
X = A if and only 'Y^bi^B^^ ~ '^b)4'ib) = ^(-B) — A implies that for any A G £(0), by ([60]) . we have 

Fp[X = A] = Pi_p[X = cPiB) - A] = 

^ det{£{(j))) / -i(0(B)-A-Ei_p[X])*Si_p[X]-i(</,(iJ)-A-Ei_p[X]) 

(27r)^V^detSi_p[X] ^ 

^ det(£((/')) / -|(A-Ep[X])*Ep[X]-i(A-Ep[X]) 

(27r)^y^detSp[X] ^ 
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with \5\ < .Ml^J^^ ag required. In the last equality we used the facts that $]i_p[X] = 

■^|_B|-Af 

Ei_p[X] = (j){B) — Ep[X] and that f/D^ = {—iifD{—ij) for any vector ^ and matrix D. This 
establishes Theorem 14.11 in full. 

4.7 Proof of main theorems 

We now proceed to deduce Theorems 12.41 and 12 . 51 from the local central limit theorem, Theorem l4.1[ 
Assume the conditions of Theorem 12.41 and let be an integer satisfying condition (jlip . We wish 
to estimate the number of subsets T d B ol size N satisfying 

W\^^^^^ = W\^^^^^ for all/ in l^. (62) 

Define P '■= j^- Let {ipa ■ B Q}aeA be a basis for V. Define the random subset T C B and 
random vector X G Q'^ as in Theorem 14.11 Since 



the event X = E,[X] means 



x = Y,Ht) 



t^T heB ' ' beB 



Thus, since {(/>a}, a S A, is a basis for V , the event X = E[X] is equivalent to 

E/(*) = M^^^^) for all/ in y. (63) 

t&T ' ' beB 

Now, by assumption, the constant function h = 1 belongs to V. Thus, on the event X = E,[X] we 
have 

l^l = E^W = ^E^(^)=^- (64) 
teT ' ' fees 

Comparing (I62p . (|63p and (|64p we see that the event X = K[X] is equivalent to the event that 
|T| = N and (j62p holds. Now, denoting by the number of subsets T d B oi size for which 
(|6^ holds it follows that 

P[A = E[A]] =ajvp^(l-p)l^l-^. (65) 
Finally, the divisibility assumption implies that 

' ' beB 

Thus we may substitute A = E[X] in Theorem 14. II to obtain 

P[A = E[A]] = ^J(^M^ (1 + 5), (66) 

(2^) — ydetS[A] 

with \5\ < '^'^'™(^) (iog(2c2 dim(v))) / ^ Comparing (1651) and (1661) proves the assertion of Theorem l2.51 

' ' - ^ymm{N,\B\-N) ^ ^ ^ ^ ^ ^ ^ 

Lastly, Theorem 12.41 follows upon noting that the assumption (|lip implies that \6\ < |, so that 
F[X = E[X]] > and hence a^v > 0. 
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4.8 Basis-free formulation of local central limit theorem 



In this section we describe an equivalent "basis-free" version of our local central limit theorem, 
Theorem 14. 1[ The theorem is a high-dimensional, lattice, local central limit theorem with a rate of 
convergence estimate involving only universal constants. 

Recall the parameter p{V) of the vector space V introduced in (jl3p . We introduce a second 
parameter of V, a non-negative definite form {■,-)v on Q^. As is the case for p{V), it is easiest 
to define (•, •)y via a choice of basis for V but we stress that it is independent of this choice. If 
(f) : B ^ is such that the vectors (</>a), a G A, form a basis for V, we define 

(71,72)1/ := 7^0(</'V)"VS (71,72 G Q^). 

In this definition, (p is regarded as a, B x A matrix with columns {</>a}- The matrix 
represents the orthogonal projection operator from (with the standard basis and inner product) 
to V. We denote the semi- norm induced from (•, •)y by || • \\v, 



nWv ■-- 



V(7,7)y (7 G Q""), 



so that ||7||y is the length of the orthogonal projection of 7 to V. Finally, we denote by 1 the 
identically one vector in Q^. 

Theorem 4.15 (Basis-free formulation of local central limit theorem). There exists a constant 
C > such that the following is true. Let B be a finite set and let V be a linear subspace of 
functions / : -B — t- Q. Assume that the following conditions hold for some integers C2, C3 > 1, 

1. Boundedness ofV: V has a C2-bounded integer basis in ic>o- 

2. Boundedness ofV'^: has a c^-bounded integer basis in £1. 

3. Symmetry: for any ^1,62 G B there exists a symmetry nofV satisfying 7r(&i) = 62. 

Let < p < 1 and form a random subset T <Z B by taking each element of B into T independently 
with probability p. If 

min(p|5|,(l -p)|S|) > C-C2cidim(y)6log(2c3dim(y))6 

then for every 7 G the probability of the event 

T.f(^^ = T.^bf{b) forallfmV (67) 



equals 



(2^p(l-p))^ 



2 



with |5(7)| < gdim(y)3(log(2c2dim(y)))3/ 

It is important to emphasize that one must take 7 £ Z^, rather than 7 G Q-^ in the theorem, 
analogously to the restriction that A S >C((/)) in Theorem 14.11 However, since V typically has 
dimension strictly less than \B\, it is possible that for some 7 G \ Z-^ there exists another 
7' G such that 

Y.^hf{h) = Y.i,f{b) for all/ in 1^. (69) 

b&B beB 
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Indeed, this is exactly the scenario we face in our main theorems. There, we are interested in 
the case that 7fe = for all 5 G B, a vector which is not in Z^. The divisibihty condition in 

Theorem 12.41 exactly ensures that for this vector there exists some 7' G Z'^ such that (I69p holds. 

Let us say that a vector 7 G has an integer representation by 7' G if (|69p holds. It is not 
difficult to check that in this case II7— p-l||y = II7'— p-l||y. Thus, our theorem remains true as 
stated if the restriction that 7 G Z-^ is replaced by the condition that 7 G and has an integer 
representation. Moreover, it is evident that if 7 has no integer representation then the probability 
of (|67p is zero, since the left hand side of (|67p exactly provides an integer representation for 7. 

We now briefly explain the equivalence of Theorem 14.11 and Theorem 14.151 Suppose that {4>a}, 
a G A, form a basis for V, define (j) : B ^ by (l){h)a = 4>a{B) and regard (/> as a i? x A matrix. 
If A G >C((/>) then, by definition, there exists some 7 G Z-^ such that 

A = j;76'/>(&) = 0S- (70) 

beB 

It is then straightforward to check that for a subset T <Z B condition (j67p is equivalent to 

^0(t) = A. (71) 

Conversely, given 7 G Z^ we may define A G C{4)) by (fTOj) and observe again that ([67|) is equivalent 
to ()7ip . Thus, to see the equivalence of the two theorems, it suffices to show that the main terms 
in the probability estimates ()36p and ()68p are equal when the relation ()70p holds. This follows from 
the definitions of /o(y), the definitions of E[X] and in Subsection ST] and the observation that 
under ([701) we have 

||7 - P • = (7 - P • 1)V('AV)~ V*(7 - P • 1) = 

= (A - E[X])*(<^V)-'(A - m]) = P{1 - p){\ - E[X])*S[X]-i(A - E[X]). 



5 Summary and open problems 

Our main theorem guarantees the existence of a small subset T C B for which ([1]) holds. The 
conditions we require are boundedness, divisibility and symmetry. In many natural scenarios it 
is easy to guarantee that V has a bounded integer basis in 5 the divisibility and the symmetry 
condition, and the condition which seems hardest to verify is that V-^ has a bounded integer basis 
in ii. In particular, the following question captures much of the difficulty. Let G be a group that 
acts transitively on a set X. A subset T C G is X -uniform (or an X -design) if it acts on X exactly 
as G does. That is, for any x,y £ X, 

G T : gix) = y}\ = j^^\{g G G : g{x) = y}\ = ^. 

In our language we may take B = G and V to be the space spanned by all functions (f'ix,!/) '■ B — t- 
{0, 1} of the form 4>{^x,y){b) = '^{b{x)=y} fo^^ x,y £ X. Then T is X-uniform if and only if ([T]) holds. 
We have given a bounded integer basis for V in -^QQ , and also by definition the symmetry condition 
holds. The other conditions are less clear. One may still speculate that: 

Conjecture 5.1. Let G be a group that acts transitively on a set X. Then there exists an X- 
uniform subset T C G such that \T\ < for some universal constant c > 0. 
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A second question is whether one can apply our techniques to get minimal objects. Recall that 
the size of the objects we achieve is only minimal up to polynomial factors. For example, one of the 
main open problems in design theory is whether there exists a Steiner system (i.e. a t-design with 
A = 1) for any t > 5. Another major open problem of a similar spirit is the existence of Hadamard 
matrices of all orders n = 4m, or equivalently, 2-(4m — 1, 2m — 1, m — 1) designs. Empirical estimates 
for n < 32 suggest that there are exp(0(nlogn)) Hadamard matrices of order n = 4m. Since there 
are so many of them, and since the logarithm of their number grows at a regular rate, we suspect 
that they exist for some purely statistical reason. However, the Gaussian local limit model seems 
to be false for Hadamard matrices interpreted as t-designs; it does not accurately estimate how 
many there are. 

A third question is whether there exists an algorithmic version of our work, similar to the 
algorithmic Moser |Mos09] and Moser-Tardos |MT10| versions of the Lovasz local lemma |EL75j . 
and the algorithmic Bansal [BanlO] and Lovett-Meka [LM12| versions of the six standard deviations 
method of Spencer |Spe85| . If an efficient randomized algorithm of our method were found, then 
we could no longer indisputably claim that we have a low-probability version of the probabilistic 
method. On the other hand it would be strange, from the viewpoint of computational complexity 
theory, if low-probability existence can always be converted to high-probability existence. Maybe 
our construction is fundamentally a low-probability construction. 

It is also of interest to extend our results to continuous setups, with one representative example 
being that of spherical designs (see, e.g., [BRVlOj and references within). We aim to develop this 
direction in future work. 

Acknowledgements. We thank David Soudry and Gady Kozma for useful remarks on the 
representation theory of the symmetric group. 
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